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Preface 


The revolt against the ancient world view of a universe centred upon the earth, 
which was initiated by Copernicus and further developed by Kepler, Galileo and 
Newton, reached its natural termination in Einstein’s theories of relativity. 
Starting from the concept that there exists a unique privileged observer of the 
cosmos, namely man himself, natural philosophy has journeyed to the opposite 
pole and now accepts as a fundamental principle that all observers are equivalent, 
in the sense that each can explain the behaviour of the cosmos by application of 
the same set of natural laws. Another line of thought whose complete 
development takes place within the context of special relativity is that pioneered 
by Maxwell, electromagnetic field theory. Indeed, since the Lorentz transform- 
ation equations upon which the special theory is based constitute none other than 
the transformation group under which Maxwell’s equations remain of invariant 
form, the relativistic expression of these equations discovered by Minkowski is 
more natural than Maxwell’s. In the history of natural philosophy, therefore, 
relativity theory represents the culmination of three centuries of mathematical 
modelling of the macroscopic physical world; it stands at the end of an era and is a 
magnificent and fitting memorial to the golden age of mathematical physics which 
came to an end at the time of the First World War. Einstein’s triumph was also his 
tragedy; although he was inspired to create a masterpiece, this proved to be a 
monument to the past and its very perfection a barrier to future development. 
Thus, although all the implications of the general theory have not yet been 
uncovered, the barrenness of Einstein's later explorations indicates that the 
growth areas of mathematical physics lie elsewhere, presumably in the fecund soil 
of quantum and elementary-particle theory. 

Nevertheless, relativity theory, especially the special form, provides a found- 
ation upon which all later developments have been constructed and it seems 
destined to continue in this role for a long time yet. A thorough knowledge of its 
elements is accordingly a prerequisite for all students who wish to understand 
contemporary theories of the physical world and possibly to contribute to their 
expansion. This being universally recognized, university courses in applied 
mathematics and mathematical physics commonly include an introductory 
course in the subject at the undergraduate level, usually in the second and third 
years, but occasionally even in the first year. This book has been written to 
provide a suitable supporting text for such courses. The author has taught this 
type of class for the past twenty-five years and has become very familiar with the 
difficulties regularly experienced by students when they first study this subject; 
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the identification of these perplexities and their careful resolution has therefore 
been one of my main aims when preparing this account. To assist the student 
further in mastering the subject, I have collected together a large number of 
exercises and these will be found at the end of each chapter; most have been set as 
course work or in examinations for my own classes and, 1 think, cover almost all 
aspects normally treated at this level. It is hoped, therefore, that the book will also 
prove helpful to lecturers as a source of problems for setting in exercise classes. 

When preparing my plan for the development of the subject, 1 decided to 
disregard completely the historical order of evolution of the ideas and to present 
these in the most natural logical and didactic manner possible. In the case of a 
fully established (and, indeed, venerable) theory, any other arrangement for an 
introductory text is unjustifiable As a consequence, many facets of the subject 
which were at the centre of attention during the early years of its evolution have 
been relegated to the exercises or omitted entirely. For example, details of the 
seminal Michelson-Morley experiment and its associated calculations have not 
been included. Although this event was the spark which ignited the relativistic 
tinder, it is now apparent that this was an historical accident and that, being 
implicit in Maxwell's principles of electromagnetism, it was inevitable that the 
special theory would be formulated near the turn of the century. Neither is the 
experiment any longer to be regarded as a crucial test of the theory, since the 
theory’s manifold implications for all branches of physics have provided 
countless other checks, all of which have told in its favour. The early controversies 
attending the birth of relativity theory are. however, of great human interest and 
students who wish to follow these are referred to the books by Clark, Hoffmann 
and Lanczos listed in the Bibliography at the end of this book. 

A curious feature of the history of the special theory is the persistence of certain 
paradoxes which arose shortly after it was first propounded by Einstein and 
which were largely disposed of at that time. In spite of this, they are rediscovered 
every decade or so and editors of popular scientific periodicals (and occasionally, 
and more reprehensibly, serious research journals) seem happy to provide space 
in which these old battles can be refought, thus generating a good deal of 
acrimony on all sides (and, presumably, improving circulation). The source of the 
paradoxes is invariably a failure to appreciate that the special theory is restricted 
in its validity to inertial frames of reference or an inability to jettison the 
Newtonian concept of a unique ordering of events in time. Complete books based 
on these misconceptions have been published by authors who should know 
better, thus giving students the unfortunate impression that the consistency of 
this system of ideas is still in doubt. 1 have therefore felt it necessary to mention 
some of these ‘paradoxes’ at appropriate points in the text and to indicate how 
they are resolved; others have been used as a basis for exercises, providing 
excellent practice for the student to train himself to think relativist ically. 

Much of the text was originally published in 1962 under the title An 
Introduction to Tensor Calculus and Relativity. All these sections have been 
thoroughly revised in the light of my teaching experience, one or two sections 
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have been discarded as containing material which has proved to be of little 
importance for an understanding of the basics (e g. relative tensors) and a number 
of new sections have been added (e.g. equations of motion of an elastic fluid, black 
holes, gravitational waves, and a more detailed account of the relationship 
between the metric and aftine connections). But the main improvement is the 
addition of a chapter covering the application of the general theory tocosmology. 
As a result of the great strides made in the development of optical and, 
particularly, radio astronomy during the last twenty years, cosmological science 
has moved towards the centre of interest for physics and very few university 
courses in the general theory now fail to include lectures in this area. 

It is a common (and desirable) practice to provide separate courses in the special 
and general theories, the special being covered in the second or third under- 
graduate year and the general in the final year of the undergraduate course or the 
first year of a postgraduate course The book has been arranged with this in mind 
and the first four chapters form a complete unit, suitable for reading by students 
who may not progress to the general theory. Such students need not be burdened 
with the general theory of tensors and Riemannian spaces, but can acquire a 
mastery of the principles of the special theory using only the unsophisticated tool 
of Cartesian tensors in Euclidean (or quasi-Euclidean) space. In my experience, 
even students who intend to take a course in the general theory also benefit from 
exposure to the special theory in this form, since it enables them to concentrate 
upon the difficulties of the relativity principles and not to be distracted by 
avoidable complexities of notation. 1 have no sympathy with the teacher who, 
encouraged by the shallow values of the times, regards it as a virtue that his 
lectures exhibit his own present mastery of the subject rather than his 
appreciation of his students’ bewilderment on being led into unfamiliar territory. 
All students should, in any case, be aware of the simpler form the theory of tensors 
assumes when the transformation group is restricted to be orthogonal. 

As a consequence of my decision to develop the special theory within the 
context of Cartesian tensors, it was necessary to reduce the special relativistic 
metric to Pythagorean form by the introduction of either purely imaginary 
spatial coordinates or a purely imaginary time coordinate for an event. 1 have 
followed Minkowski and put x 4 = ict; thus, the metric has necessarily been taken 
in the form 

ds 2 = dx, 2 +dx 2 2 +dx 3 2 + dx 4 2 = dx 2 +d>' 2 +dz 2 -c 2 df 2 

and ds has the dimension of length. I have retained this definition of the interval 
between two events observed from a freely falling frame in the general theory; this 
not only avoids confusion but, in the weak-field approximation, permits the 
distinction between covariant and contravariant components of a tensor to be 
eliminated by the introduction of an imaginary time. A disadvantage is that ds is 
imaginary for timelike intervals and the interval parameter s accordingly takes 
imaginary values along the world-line of any material body. Thus, when writing 
down the equations for the geodesic world-line of a freely falling body, it is 
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usually convenient to replaces by t, defined by the equation s = ict, z being called 
the proper time and dr the proper time interval. However, it is understood 
throughout the exposition of the general theory that the metric tensor for space- 
time y,j is such that ds 2 = g i} dx‘ dx J ; a consequence is that the cosmical constant 
term in Einstein’s equation of gravitation has a sign opposite to that taken by 
some authors. 

References in the text are made by author and year and have been collected 
together at the end of the book. 


D. F. Lawden 


Department of Mathematics, 

The University of Aston in Birmingham. 
May, 1981. 
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CHAPTER 1 


Special Principle of Relativity. Lorentz 
Transformations 


1. Newton's laws of motion 

A proper appreciation of the physical content of Newton’s three laws of motion is 
an essential prerequisite for any study of the special theory of relativity. It will be 
shown that these laws are in accordance with the fundamental principle upon 
which the theory is based and thus they will also serve as a convenient 
introduction to this principle. 

The first law states that any particle which is not subjected to forces moves along 
a straight line at constant speed. Since the motion of a particle can only be specified 
relative to some coordinate frame of reference, this statement has meaning only 
when the reference frame to be employed when observing the particle’s motion 
has been indicated. Also, since the concept of force has not, at this point, received 
a definition, it will be necessary to explain how we are to judge when a particle is 
'not subjected to forces’. It will be taken as an observed fact that if rectangular 
axes are taken with their origin at the centre of the sun and these axes do not 
rotate relative to the most distant objects known to astronomy, viz. the 
extragalactic nebulae, then the motions of the neighbouring stars relative to this 
frame are very nearly uniform. The departure from uniformity can reasonably be 
accounted for as due to the influence of the stars upon one another and the 
evidence available suggests very strongly that if the motion of a body in a region 
infinitely remote from all other bodies could be observed, then its motion would 
always prove to be uniform relative to our reference frame irrespective of the 
manner in which the motion was initiated. 

We shall accordingly regard the first law as asserting that, in a region of space 
remote from all other matter and empty save for a single test particle, a reference 
frame can be defined relative to which the particle will always have a uniform 
motion. Such a frame will be referred to as an inertial frame. An example of such 
an inertial frame which is conveniently employed when discussing the motions of 
bodies within the solar system has been described above. However, if S is any 
inertial frame and S is another frame whose axes are always parallel to those of S 
but whose origin moves with a constant velocity u relative to S, then S also is 
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inertial. For, if v, v are the velocities of the test particle relative to S, S respectively, 
then 


v 


V - u 


( 1 . 1 ) 


and, since v is always constant, so is v. It follows, therefore, that a frame whose 
origin is at the earth’s centre and whose axes do not rotate relative to the stars can, 
for most practical purposes, be looked upon as an inertial frame, for the motion of 
the earth relative to the sun is very nearly uniform over periods of time which are 
normally the subject of dynamical calculations. In fact, since the earth’s rotation 
is slow by ordinary standards, a frame which is fixed in this body can also be 
treated as approximately inertial and this assumption will only lead to 
appreciable errors when motions over relatively long periods of time are being 
investigated, e.g. Foucault’s pendulum, long-range gunnery calculations. A frame 
attached to a non-rotating spaceship, whose rocket motor is inoperative and 
which is moving in a negligible gravitational field (e.g. in interstellar space), 
provides another example of an inertial frame. Since the stars of our galaxy move 
uniformly relative to one another over very long periods of time, the frames 
attached to them will all be inertial provided they do not rotate relative to the 
other galaxies. 

Having established an inertial frame, if it is found by observation that a particle 
does not have a uniform motion relative to the frame, the lack of uniformity is 
attributed to the action of a force which is exerted upon the particle by some 
agency. For example, the orbits of the planets are considered to be curved on 
account of the force of gravitational attraction exerted upon these bodies by the 
sun and when a beam of charged particles is observed to be deflected when a bar 
magnet is brought into the vicinity, this phenomenon is understood to be due to 
the magnetic forces which are supposed to act upon the particles. If v is the 
particle’s velocity relative to the frame at any instant t, its acceleration a = dv/d r 
will be non-zero if the particle’s motion is not uniform and this quantity is 
accordingly a convenient measure of the applied force f. We take, therefore. 


where m is a constant of proportionality which depends upon the particle and is 
termed its mass. The definition of the mass of a particle will be given almost 
immediately when it arises quite naturally out of the third law of motion. 
Equation (1.2) is essentially a definition of force relative to an inertial frame and is 
referred to as the second law of motion. It is sometimes convenient to employ a 
non-inertial frame in dynamical calculations, in which case a body which is in 
uniform motion relative to an inertial frame and is therefore subject to no forces, 
will nonetheless have an acceleration in the non-inertial frame. By equation (1.2), 
to this acceleration there corresponds a force, but this will not be attributable to 
any obvious agency and is therefore usually referred to as a ’fictitious’ force. Well- 
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known examples of such forces are the centrifugal and Coriolis forces associated 
with frames which are in uniform rotation relative to an inertial frame, e g. a 
frame rotating with the earth. By introducing such ‘fictitious’ forces, the second 
law of motion becomes applicable in all reference frames. Such forces are called 
inertial forces (see Section 44). 

According to the third law of motion, when two particles P and Q interact so as 
to influence one another’s motion , the force exerted by P on Q is equal to that 
exerted by Q on P but is in the opposite sense. Defining the momentum of a particle 
relative to a reference frame as the product of its mass and its velocity, it is proved 
in elementary textbooks that the second and third laws taken together imply that 
the sum of the momenta of any two particles involved in a collision is conserved. 
Thus, if m t ,m 2 are the masses of two such particles and u,, u 2 are their respective 
velocities immediately before the collision and v,, v 2 are their respective velocities 
immediately afterwards, then 

m, u, + m 2 u 2 = m, v, + m 2 v 2 (1.3) 


'• e — (u 2 -v 2 ) = V, -u, (1.4) 

m, 

This last equation implies that the vectors u 2 - v 2 , v, - u, are parallel, a result 
which has been checked experimentally and which constitutes the physical 
content of the third law. However, equation (1.4) shows that the third law is also, 
in part, a specification of how the mass of a particle is to be measured and hence 
provides a definition for this quantity. For 


= h -«i | 

m, jtt 2 v 2 1 


(1.5) 


and hence the ratio of the masses of two particles can be found from the results of 
a collision experiment. If, then, one particular particle is chosen to have unit mass 
(e.g. the standard kilogramme), the masses of all other particles can, in principle, 
be determined by permitting them to collide with this standard and then 
employing equation (1.5). 


2. Covariance of the laws of motion 

It has been shown in the previous section that the second and third laws are 
essentially definitions of the physical quantities force and mass relative to a given 
reference frame. In this section, we shall examine whether these definitions lead to 
different results when different inertial frames are employed. 

Consider first the definition of mass. If the collision between the particles m,, 
m 2 is observed from the inertial frame S, let u , , u 2 be the particle velocities before 
the collision and v,, v 2 the corresponding velocities after the collision. By 
equation (1.1), 


u, = u, -u. 


etc. 


(2.1) 
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and hence 

V, -u, =V|-u„ D 2 - v 2 = u 2 — v 2 (2.2) 

It follows that if the vectors v, - u,, u 2 — v 2 are parallel, so are the vectors v, — u,, 
u 2 — v 2 and consequently that, in so far as the third law is experimentally 
verifiable, it is valid in all inertial frames if it is valid in one. Now let m { ,m 2 be the 
particle masses as measured in S. Then, by equation (1.5), 

= jv, ~»i| = h ~Utl _ 2 3) 

m, |u 2 — v 2 | |u 2 -v 2 | m. 

But, if the first particle is the unit standard, then m, = m, = 1 and hence 

m 2 = m 2 (2.4) 

i.e. the mass of a particle has the same value in all inertial frames. We can express 
this by saying that mass is an invariant relative to transformations between 
inertial frames. 

By differentiating equation (1.1) with respect to the time /, since u is 
constant it is found that 

a = a (2.5) 

where a, a are the accelerations of a particle relative to S, S respectively. Hence, by 
the second law (1.2), since m = m, it follows that 

T=f (2.6) 

i.e. the force acting upon a particle is independent of the inertial frame in which it 
is measured. 

It has therefore been shown that equations (1.2), (1.4) take precisely the same 
form in the two frames, S, S, it being understood that mass, acceleration and force 
are independent of the frame and that velocity is transformed in accordance with 
equation (1.1). When equations preserve their form upon transformation from 
one reference frame to another, they are said to be covariant with respect to such a 
transformation. Newton’s laws of motion are covariant with respect to a 
transformation between inertial frames. 

3. Special principle of relativity 

The special principle of relativity asserts that all physical laws are covariant with 
respect to a transformation between inertial frames. This implies that all observers 
moving uniformly relative to one another and employing inertial frames will be in 
agreement concerning the statement of physical laws. No such observer, 
therefore, can regard himself as being in a special relationship to the universe not 
shared by any other observer employing an inertial frame; there are no privileged 
observers. When man believed himself to be at the centre of creation both 
physically and spiritually, a principle such as that we have just enunciated would 
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have been rejected as absurd. However, the revolution in attitude to our physical 
environment initiated by Copernicus has proceeded so far that today the 
principle is accepted as eminently reasonable and very strong evidence contradict- 
ing the principle would have to be discovered to disturb it as the foundation upon 
which theoretical physics is based. It is this principle which guarantees that 
observers inhabiting distant planets, belonging to stars whose motions may be 
very different from that of our own sun, will nevertheless be able to explain their 
local physical phenomena by application of the same physical laws we use 
ourselves. 

It has been shown already that Newton’s laws of motion obey the principle. Let 
us now transfer our attention to another set of fundamental laws governing non- 
mechanical phenomena, viz. Maxwell’s laws of electrodynamics. These are more 
complex than the laws of Newton and are most conveniently expressed by the 
equations 

curl E = — cB/ct (3.1) 

curl H = j + c’D/f’r (3.2) 

div D = p (3.3) 

div B = 0 (3.4) 

where E, H are the electric and magnetic intensities respectively, D is the 
displacement, B is the magnetic induction, j is the current density and p is the 
charge density (SI units arc being used). Experiment confirms that these 
equations are valid when any inertial frame is employed. The most famous such 
experiment was that carried out by Michelson and Morley, who verified that the 
velocity of propagation of light waves in any direction is always measured to be 
c ( = 3 x 10 H m s' 1 ) relative to an apparatus stationary on the earth. As is well 
known, light has an electromagnetic character and this result is predicted by 
equations (3. 1 > -(3.4). However, the velocity of the earth in its orbit at any time 
differs from its velocity six months later by twice the orbital velocity, viz. 60 km/s 
and thus, by taking measurements of the velocity of light relative to the earth on 
two days separated by this period of time and showing them to be equal, it is 
possible to confirm that Maxwell’s equations conform to the special principle of 
relativity. This is effectively what Michelson and Morley did. However, this 
interpretation of the results of their experiment was not accepted immediately, 
since it was thought that electromagnetic phenomena were supported by a 
medium called the aether and that Maxwell’s equations would prove to be valid 
only in an inertial frame stationary in this medium, i.e. the special principle of 
relativity was denied for electromagnetic phenomena. It was supposed that an 
‘aether wind’ would blow through an inertial frame not at rest in the aether and 
that this would have a disturbing effect on the propagation of electromagnetic 
disturbances through the medium, in the same way that a wind in the atmosphere 
affects the spread of sound waves. In such a frame. Maxwell’s equations would (it 
was surmised) need correction by the inclusion of terms involving the wind 
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velocity That this would imply that terrestrial electrical machinery would behave 
differently in winter and summer does not appear to have raised any doubts! 

After Michelson and Morley's experiment, a long controversy ensued and, 
though this is of great historical interest, it will not be recounted in this book. The 
special principle is now firmly established and is accepted on the grounds that the 
conclusions which may be deduced from it are everywhere found to be in 
conformity with experiment and also because it is felt to possess a priori a high 
degree of plausibility. A description of the steps by which it ultimately came to be 
appreciated that the principle was of quite general application would therefore be 
superfluous in an introductory text. It is, however, essential for our future 
development of the theory to understand the prime difficulty preventing an early 
acceptance of the idea that the electromagnetic laws are in conformity with the 
special principle. 

Consider the two inertial frames S, S. Suppose that an observer employing S 
measures the velocity of a light pulse and finds it to be e. If the velocity of the same 
light pulse is measured by an observer employing the frame S, let this be c. Then, 
by equation (1.1), 

c = c -u (3.5) 

and it is clear that, in general, the magnitudes of the vectors c, c will be different. It 
appears to follow, therefore, that either Maxwell’s equations (3.1)— (3.4) must be 
modified, or the special principle of relativity abandoned for electromagnetic 
phenomena. Attempts were made (e.g. by Ritz) to modify Maxwell’s equations, 
but certain consequences of the modified equations could not be confirmed 
experimentally. Since the special principle was always found to be valid, the only 
remaining alternative was to reject equation (1.1) and to replace it by another in 
conformity with the experimental result that the speed of light is the same in all 
inertial frames. As will be shown in the next section, this can only be done at the 
expense of a radical revision of our intuitive ideas concerning the nature of space 
and time and this was very understandably strongly resisted. 

4. Lorentz transformations. Minkowski space-time 

The argument of this section will be founded on the following three postulates: 

Postulate I. A particle free to move under no forces has constant velocity in 
any inertial frame. 

Postulate 2. The speed of light relative to any inertial frame is c in all 
directions. 

Postulate 3. The geometry of space is Euclidean in any inertial frame. 

Let the reference frame S comprise rectangular Cartesian axes O xyz. We shall 
assume that the coordinates of a point relative to this frame are measured by the 
usual procedure and employing a measuring scale which is stationary in S (it is 
necessary to state this precaution, since it will be shown later that the length of a 
bar is not independent of its motion). It will also be supposed that standard 
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atomic clocks, stationary relative to S, are distributed throughout space and are 
all synchronized with a master-clock at O. A satisfactory synchronization 
procedure would be as follows: Warn observers at all clocks that a light source at 
O will commence radiating at t = i 0 . When an observer at a point P first receives 
light from this source, he is to set the clock at P to read t 0 + OP/c, i.e. it is assumed 
that light travels with a speed c relative to S,as found by experiment. The position 
and time of an event can now be specified relative to S by four coordinates 
(x, y, z, t), t being the time shown on the clock which is contiguous to the event. 
We shall often refer to the four numbers (x, y, z, t) as an eieni. 

Let Oxy z be rectangular Cartesian axes determining the frame S (to be precise, 
these are rectangular as seen by an observer stationary in S) and suppose that 
clocks at rest relative to this frame are synchronized with a master at O. Any event 
can now be fixed relative to S by four coordinates (x, y, z, 7), the space coordinates 
being measured by scales which are at rest in S and the time coordinate by the 
contiguous clock at rest in S. If (x, y, z, /), (x, y, z, 7) relate to the same event, in 
this section we are concerned to find the equations relating these corresponding 
coordinates. It is helpful to think of these transformation equations as a 
dictionary which enables us to translate a statement relating to any set of events 
from the S-language to the S-language (or vice versa). 

The possibility that the length of a scale and the rate of a clock might be affected 
by uniform motion relative to a reference frame was ignored in early physical 
theories. Velocity measurements were agreed to be dependent upon the reference 
frame, but lengths and time measurements were thought to be absolute. In 
relativity theory, as will appear, very few quantities are absolute, i.e. are 
independent of the frame in which the measuring instruments are at rest. 

To comply with Postulate 1, we shall assume that each of the coordinates 
(x, y, z, 7 ) is a linear function of the coordinates (x, y, z, t). The inverse relation- 
ship is then of the same type. A particle moving uniformly in S with velocity 
(t\, i r v z ) will have space coordinates (x, y, z) such that 

x = x 0 + c x f, y = y 0 + c f t, z = z 0 + v z i (4.1) 

If linear expressions in the coordinates (x, y, z, 7) are now substituted for 
(x, y, z, r), it will be found on solving for (x, y, z,) that these quantities are linear in 
/ and hence that the particle's motion is uniform relative to S. In fact, it may be 
proved that only a linear transformation can satisfy the Postulate 1. 

Now suppose that at the instant t = i 0 a light source situated at the point P 0 
(x 0 , y 0 , z 0 ) in S radiates a pulse of short duration. At any later instant t, the 
wavefront will occupy the sphere whose centre is P 0 and radius c(t - r 0 ). This has 
equation 

(x - x 0 ) 2 + (y - y 0 ) 2 + (z - z 0 ) 2 = c 2 (t - t 0 ) 2 (4.2) 

Let (Xo, yo. ^o) be the coordinates of the light source as observed from S at the 
instant 7 = 7 0 the short pulse is radiated. At any later instant 7, in accordance with 
Postulate 2, the wavefront must also appear from S to occupy a sphere of radius 
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eft -t 0 ) and centre (x 0 , y 0 , z 0 ). This has equation 

(x -x 0 ) 2 + (y-y 0 ) 2 + (z -z 0 ) 2 = c 2 (r -f 0 ) 2 (4.3) 

Equations (4.2), (4.3) describe the same set of events in languages appropriate to S, 
S respectively. It follows that the equations relating the coordinates (x, y, z, r ), 
(x, y, 2 , t) must be so chosen that, upon substitution for the 'barred' quantities 
appearing in equation (4.3) the appropriate linear expressions in the 'unbarred' 
quantities, equation (4.2) results. 

A mathematical device due to Minkowski will now be employed. We shall 
replace the time coordinate / of any event observed in S by a purely imaginary 
coordinate x 4 = id (i = J — 1). The space coordinates (x, y, z) of the event will 
be replaced by (x ( , x 2 , x 2 ) so that 

x = x,, y = x 2 , z = x 3 , ict = x 4 (4.4) 

and any event is then determined by four coordinates x, ( i = 1, 2, 3, 4) A similar 
transformation to coordinates x, will be carried out in S. Equations (4.2), (4.3) can 
then be written 

X (Xj - x l0 ) 2 = 0 

» = I 

X (x - *.o ) 2 = 0 

> = i 

The x, are to be linear functions of the x, and such as to transform equation (4.6) 
into equation (4.5) and hence such that 

4 4 

Z (x. - x,o ) 2 Z (*i - x.o ) 2 < 4 - 7 ) 

i = 1 t = ! 

k can only depend upon the relative velocity of S and S. It is reasonable to assume 
that the relationship between the two frames is a reciprocal one, so that, when the 
inverse transformation is made from S to S, then 

4 4 

X (X; -x.o) 2 -* k X (x, -x l0 ) 2 (4.8) 

1=1 1=1 

But the transformation followed by its inverse must leave any function of the 
coordinates x, unaltered and hence k 2 = 1. In the limit. as the relative motion of S 
and S is reduced to zero, it is clear that k -» + 1. Hence k ± - 1 and we conclude 
that k is identically unity. 

The x, will now be interpreted as rectangular Cartesian coordinates in a four- 
dimensional Euclidean space which we shall refer to as (? 4 . This space is termed 
Minkowski space-time. The left-hand member of equation (4.5) is then the square 
of the 'distance' between two points having coordinates x„ x l0 . It is now clear that 
the x, can be interpreted as the coordinates of the point x, referred to some other 
rectangular Cartesian axes in S 4 . For such an interpretation will certainly enable 


(45) 

(4.6) 
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us to satisfy the requirement (4.7) (with k = 1 ). Also, the x„ x, will then be related 
by equations of the form 

4 

x, = Y, a ij x i + (4.9) 

j = * 

where i = 1, 2, 3, 4 and the h, are constants and this relationship is linear. The 
b, are the coordinates of the origin of the first set of rectangularaxes relative to the 
second set. The will be shown to satisfy certain identities in Chapter 2 
(equations (8.14), (8.15)). It is proved in algebra texts that the relationship 
between the x, and x, must be of the form we are assuming, if it is (i) linear and (ii) 
such as to satisfy the requirement (4.7). 

Changing back from the x,, x, to the original coordinates of an event by 
equations (4.4), the equations (4.9) provide a means of relating space and time 
measurements in S with the corresponding measurements in S. Subject to certain 
provisos (e.g. an event which has real coordinates in S, must have real coordinates 
in S), this transformation will be referred to as the general Loreniz transformation. 

5. The special Lorentz transformation 

We shall now investigate the special Lorentz transformation obtained by 
supposing that the x,-axes in <? 4 are obtained from the x r axes by a rotation 
through an angle a parallel to the x, x„-plane. The origin and the x 2 , x 3 -axes are 
unaffected by the rotation and it will be clear after consideration of Fig. 1 
therefore that 

x ! — x j cos x x 4 sin x x 2 = x 2 
x 4 = -x, sina +x 4 cos« x 3 = x 3 



(5.1) 


Fig 1 
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Employing equations (4.4), these transformation equations may be written 

x = xcosa + iff sins y = y) 

itY = -xsinx + ic/cosa z=z J (5.2) 

To interpret the equations (5.2), consider a plane which is stationary relative to 
the S frame and has equation 

ax + by + cz + d = 0 (5.3) 

for all t. Its equation relative to the S frame will be 

(acosa)x + by + cz + d + icrasinx = 0 (5.4) 

at any fixed instant t. In particular, if a = b = d = 0, this is the coordinate plane 

Oxy and its equation relative to S is z — 0, i.e. it is the plane Oxy. Again, if b = c 

= d = 0, the plane is Oyz and its equation in S is 

x = -icftanx (5.5) 

i.e. it is a plane parallel to Oyz displaced a distance - ici tan x along Ox. Finally, if 
a = c = d = 0, the plane is Ozx and its equation with respect to S is y = 0, i.e. it is 
the plane Ozx. We conclude, therefore, that the Lorentz transformation 
equations (5.2) correspond to the particular case when the coordinate planes 
comprising S are obtained from those comprising S at any instant t by a 
translation along Ox a distance -icf tana (Fig. 2). Thus, if u is the speed of 
translation of S relative to S, 

u = -if tana (5.6) 

It should also be noted that the events 

x = y = z = t = 0, x = y = 2 = 7 = 0 


y y 



Fig. 2 
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correspond and hence that, at the instant O and O coincide, the Sand S clocks at 
these points are supposed set to have zero readings; all other clocks are then 
synchronized with these 

Equation (5.6) indicates that a is imaginary and is directly related to the speed 
of translation. We have tana = iu/c and hence 

1 (iu/c) 

cosoc = — j- = — s--, sina = — 7 — r- (5.7) 

7 d -uV) 70- u V) 

Substituting in the equations (5.2), the special Lorentz transformation is obtained 
in its final form, viz. 

x = /?(x-ut) y = yl 

T = /?(r - ux/c 2 ) z=zj (5.8) 


where = (1 -u 2 /c 2 )~ 1 ,2 . 

If u is small by comparison with c, as is generally the case, these equations may 
evidently be approximated by the equations 


x = x — ut y = y 
T= t Z — Z 


(5.9) 


This set of equations, called the special Galilean transformation equations, is, of 
course, the set which was assumed to relate space and time measurements in the 
two frames in classical physical theory. However, the equation T = t was rarely 
stated explicitly, since it was taken as self-evident that time measurements were 
absolute, i.e. quite independent of the observer. It appears from equations (5.8) 
that this view of the nature of time can no longer be maintained and that, in fact, 
time and space measurements are related, as is shown by the dependence off upon 
both t and x. This revolutionary idea is also suggested by the manner in which the 
special Lorentz transformation has been derived, viz. by a rotation of axes in a 
manifold which has both spacelike and timelike characteristics. However, this 
does not imply that space and time are now to be regarded as basically similar 
physical quantities, for it has only been possible to place the time coordinate on 
the same footing as the space coordinates in <? 4 by multiplying the former by i. 
Since x 4 must always be imaginary, whereas x,, x 2 , x 3 are real, the fundamentally 
different nature of space and time measurements is still maintained in the new 
theory. 

If u > c, both x and t as given by equations (5.8) are imaginary. We conclude 
that no observer can possess a velocity greater than that of light relative to any 
other observer. 

If equations (5.8) are solved for (x, y, z, t) in terms of (x, y, z, f ), it will be found 
that the inverse transformation is identical with the original transformation, 
except that the sign of u is reversed. This also follows from the fact that the inverse 
transformation corresponds to a rotation of axes through an angle -a in 
space -time. Thus, the frame S has velocity — u when observed from S. 
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6. Fitzgerald contraction. Time dilation 

In the next two sections, we shall explore some of the more elementary physical 
consequences of the transformation equations (5.8). 

Consider first a rigid rod stationary in S and lying along the x-axis. Let x 
= x,. x = x 2 at the two ends of the bar so that its length as measured in S is given 
by 

1 = x 2 x, (6.1) 

In the frame S, the bar is moving with speed u and, to measure its length, it is 

necessary to observe the positions of its two ends at the same instant t. Suppose 
chalk marks are made on the x-axis at x = x t , x = x 2 , opposite the two ends, at 
the instant /. The making of these marks constitutes a pair of events with 
space-time coordinates (x,, t), (x 2 , i) in S. In S, this pair of events must have 
coordinates (x,,f|), (x 2 , i 2 ). Equations (5.8) now require that 

x, = /J(x, -ut), x 2 = IUx 2 -ut ) (6.2) 

But x 2 -x, — l is the length of the bar as measured in S and it follows by 
subtraction of equations (6.2) that 

/-r v /(i-«V) (6 3) 

The length of a bar accordingly suffers contraction when it is moved longitudin- 
ally relative to an inertial frame. This is the Fitzgerald contraction. 

This contraction is not to be thought of as the physical reaction of the rod to its 
motion and as belonging to the same category of physical effects as the 
contraction of a metal rod when it is cooled. It is due to a changed relationship 
between the rod and the instruments measuring its length. I is a measurement 
carried out by scales which are stationary relative to the bar, whereas / is the result 
of a measuring operation with scales which are moving with respect to the bar. 
Also, the first operation can be carried out without the assistance of a clock, but 
the second operation involves simultaneous observation of the two ends of the 
bar and hence clocks must be employed. In classical physics, it was assumed that 
these two measurement procedures would yield the same result, since it was 
supposed that a rigid bar possessed intrinsically an attribute called its length and 
that this could in no way be alfected by the procedure employed to measure it. It is 
now understood that length, like every other physical quantity, is defined by the 
procedure employed for its measurement and that it possesses no meaning apart 
from being the result of this procedure. From this point of view, it is not 
surprising that, when the procedure must be altered to suit the circumstances, the 
result will also be changed. It may assist the reader to adopt the modern view of 
the Fitzgerald contraction if we remark that the length of the rod considered 
above can be altered at any instant by simply changing our minds and 
commencing to employ the S frame rather than the S frame. Clearly, such a 
change of mathematical description can have no physical consequences. 
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Consider again the two events when chalk marks are made on the x-axis. 
Applying equations (5.8) to the space-time coordinates of the events in the two 
frames, the following equations are obtained: 

T\ — P{i ~ux,/c 2 ), T 2 = P(t -ux 2 /c 2 ) (6.4) 

These equations show that 7, ± f 2 ;i.e. although the events are simultaneous in S, 
they are not simultaneous in S. The concept of simultaneity is accordingly also a 
relative one and has no absolute meaning as was previously thought. 

The registration by the clock moving with O of the times T , , t 2 constitutes two 
events having coordinates (0, 0, 0, 7j ), (0,0, 0, t 2 ) respectively in S. Employing the 
inverse transformation to (5.8), it follows that the times / i 2 of these events as 
measured in S are given by 

h=Ph (6.5) 

and hence that 

7, -T 2 = (i, -t 2 ) J(\ -u 2 /c 2 ) (6.6) 

This equation shows that the clock moving with O will appear from S to have its 
rate reduced by a factor ^(1 -u 2 /c 2 ). This is the time dilation effect. 

Since any physical process can be employetf as a clock, the result just obtained 
implies that all physical processes will evolve more slowly when observed from a 
frame relative to which they are moving. Thus, the rate of decay of radioactive 
particles present in cosmic rays and moving with high velocities relative to the 
earth has been observed to be reduced by exactly the factor predicted by equation 
( 6 . 6 ). 

It may also be deduced that, if a human passenger were to be launched from the 
earth in a rocket which attained a speed approaching that of light and after 
proceeding to a great distance returned to the earth with the same high speed, 
suitable observations made from the earth would indicate that all physical 
processes occurring within the rocket, including the metabolic and physiological 
processes taking place inside the passenger’s body, would suffer a retardation. 
Since all physical processes would be affected equally, the passenger would be 
unaware of this effect. Nonetheless, upon return to the earth he would find that 
his estimate of the duration of the flight was less than the terrestrial estimate. It 
may be objected that the passenger is entitled to regard himself as having been at 
rest and the earth as having suffered the displacement and therefore that the 
terrestrial estimate should be less than his own. This is the clock paradox. The 
paradox is resolved by observing that a frame moving with the rocket is subject to 
an acceleration relative to an inertial frame and consequently cannot be treated as 
inertial. The results of special relativity only apply to inertial frames and the 
rocket passenger is accordingly not entitled to make use of them in his own frame. 
As will be shown later, the methods of general relativity theory are applicable in 
any frame and it may be proved that, if the passenger employs these methods, his 
calculations will yield results in agreement with those obtained by the terrestrial 
observer. 
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Another clock paradox which requires more thought to resolve, can be stated 
thus: The clock at O runs slow when compared with O. But the frame S may be 
taken as the rest frame and then a similar argument proves that the moving clock 
O goes slow when compared with O This is a contradiction. Only inertial frames 
are involved, so that the paradox cannot be disposed of by rejecting one of the two 
calculations. Instead, it must be appreciated that a direct comparison of two 
clocks at different points in space cannot be made; the statement ‘O runs slow by 
comparison with O' needs amplification. The meaning special relativity theory 
gives to this sentence is: O is found to run slow when it is compared with the 
successive synchronized clocks, belonging to the frame S, with which it coincides 
during its motion. Similarly, ‘O runs slow compared with O’ must be expanded to 
'O runs slow when compared with the successive synchronized clocks belonging 
to S with which it coincides during its motion'. There is no contradiction between 
these expanded statements (see Exercise 19 at the end of this chapter). 

7. Spacelike and timelike intervals. Light cone 

We have proved in section 4 that if x„ x [0 are the coordinates in Minkowski 
space- time of two events, then 

X ( 71 ) 

i = i 

is invariant, i.e. has the same value for all observers employing inertial frames and 
thus rectangular axes in space-time. Reverting by equations (4.4) to the ordinary 
space and time coordinates employed in an inertial frame, it follows that 

(x - x 0 ) 2 + (>■ - y 0 ) 2 + (z - z 0 ) 2 - c 2 (t - t 0 ) 2 (7.2) 

is invariant for all inertial observers. 

Thus, if (x, y, z, t), (x 0 , y 0 , z 0 , r 0 ) are the coordinates of two events relative to 
any inertial frame S and we define the proper lime interval z between the events by 
the equation 

r 2 = (t -t 0 ) 2 - (x -x 0 ) 2 + (> -> 0 ) 2 + (z z 0 ) 2 } (7.3) 

then z is an invariant for the two events. Two observers employing different 
inertial frames may attribute different coordinates to the events, but they will be 
in agreement concerning the value of t. 

Denoting the time interval between the events by At and the distance between 
them by Ad, both relative to the same frame S and positive, it follows from 
equation (7.3) that 

t 2 = At 2 —-^Ad 1 (7.4) 

c 

Suppose that a new inertial frame S is now defined, moving in the direction of the 
line joining the events with speed Ad/ At. This will only be possible if Ad/ At < c. 
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Relative to this frame the events will occur at the same point and hence Ad = 0. 
By equation (7.4), therefore, 

r 2 = A/ 2 (7.5) 

i.e. the proper time interval between two events is the ordinary time interval 
measured in a frame (if such exists) in which the events occur at the same space 
point. In this case, it is clear that r 2 > 0 and the proper time interval between the 
events is said to be timelike. 

Suppose, if possible, that a frame S can be chosen relative to which the events 
are simultaneous. In this frame At = 0 and 

t 2 = — , A d 2 (7.6) 

c 

Thus r 2 < 0, and, in any frame, Ad /At > c. r is then purely imaginary and the 
interval between the events is said to be spacelike. 

If the interval is timelike, Ad/At < c and it is possible for a material body to 
be present at both events. On the other hand, if the interval is spacelike. Ad/ At > c 
and it is not possible for such a body to be present at both events. The 
intermediate case is when Ad /At - c and then z = 0. Only a light pulse can be 
present at both events. It also follows that the proper time interval between the 
transmission and reception of a light signal is zero. 

We shall now represent the event (x, y, z, t) by a point having these coordinates 
in a four-dimensional space. This space is also often referred to as Minkowski 
space- time but, unlike the space-time continuum introduced in section 4, it is not 
Euclidean. However, this representation has the advantage that the coordinates 
all take real values and it is therefore more satisfactory when diagrams are to be 
drawn. Suppose a particle is at the origin O of S at t =0 and commences to move 
along Ox with constant speed u. Its y- and z-coordinates will always be zero and 
the representation of its motion in space-time will be confined to the xt-plane. In 
this plane, its motion will appear as the straight line QP, Q being the point x — y 
= z = / = 0 (Fig. 3). QP is called the world-line of the particle. If L PQr = 0, 
tan# = u. But |uj $ c and hence the world-line of the particle must lie in the 
sector AQB, where L AQB = 2a and tan a = c. Similarly, the world-line of a 
particle which arrives at O at t = 0 after moving along Ox, must lie in the sector 
A'QB'. It follows that any event in either of these sectors must be separated from 
the event Q by a timelike interval, since a particle can be present at both events. 
Events in the sectors AQB', A'QB are separated from Q by spacelike intervals, 
since it is impossible for a particle to be present at such an event and also at Q. 
A'A, B'Bare the world-lines of light signals passing through Oat t = Oand being 
propagated in the directions of the positive and negative x-axis respectively. 

For any event in AQB, t > 0, i.e. it is in the future with respect to the event Q 
when the frame S is being employed. However, by no choice of frame can it be 
made simultaneous with Q, for this would imply a spacelike interval. A fortiori, in 
no frame can it occur prior to Q. The sector AQB accordingly contains events 
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Fig. 3 

which are in the absoluiejuture with respect to the event Q. Similarly, all events in 
the sector A' QB' are in the absolute past with respect to Q. On the other hand, 
events lying in the sectors AQB', A'QB are separated from Q by spacelike 
intervals and can all be made simultaneous with Q by proper choice of inertial 
frame. These events may occur before or after Q depending upon the frame being 
used. These two sectors define a region of space-time which will be termed the 
conditional present. 

Since no physical signal can have a speed greater than c, the world-line of any 
such signal emanating from Q must lie in the sector AQB. It follows that the event 
Q can be the physical cause of only those events which are in the absolute future 
with respect to Q. Similarly, Q can be the effect of only those events in its absolute 
past. Q cannot be casually related to events in its conditional present. 

This state of affairs should be contrasted with the essentially simpler situation 
of classical physics where there is no upper limit to the signal velocity and AA', 
BB' coincide along the x-axis. Past and future are then separated by a perfectly 
precise present in which events all have the time coordinate t = 0 for all observers. 

In the four-dimensional space Qxyzr, the three regions of absolute past, 
absolute future and conditional present are separated from one another by the 
hyper-cone 

x 2 + y 2 + z 2 — c 2 t 2 = 0 (7.7) 

A light pulse transmitted from Q will have its world-line on this surface, which is 
accordingly called the light cone at Q. Since any arbitrary event can be selected to 
be Q, any event is the apex of a light cone which separates the space-time 
continuum in an absolute manner into three distinct regions relative to the event. 
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Exercises 1 

1. A particle of mass m is moving in the plane of axes Oxy under the action of a 
force f, Oxy is an inertial frame. Ox' y' is rotating relative to the inertial frame so 
that L x'Ox = i/'and iji = cj. (r, 0)are polar coordinates of the particle relative to 
the rotating frame. If (/ r ,/ fl ) are the polar components of f, (a,, a 6 ) are the polar 
components of the particle’s acceleration relative to Ox'y', v is the particle’s speed 
relative to this frame and 4> is the angle its direction of motion makes with the 
radius vector in this frame, obtain the equations of motion in the form 

ma r = f r + Imcjvsin 4> + mrw 1 
ma e = f e — 2mawcos <j> — mrcb 

Deduce that the motion relative to the rotating frame is in accordance with the 
second law if, in addition to f, the following forces are also taken to act on the 
particle: (i) mw 2 r radially outwards (the centrifugal force), (ii) 2 mcov at right 
angles to the direction of motion (the Coriolis force), (iii) mr<b transversely. (The 
latter force vanishes if the rotation is uniform.) 

2. A bar lies along Ox and is stationary in S. Show that if the positions of its 
ends are observed in S at instants which are simultaneous in S, its length 
deduced from these observations will be greater than its length in S by a factor 
(1 -u J /c 2 )~ 1 ,J . 

3. Suppose that the bar referred to in Exercise 2 takes a time T to pass a fixed 
point on the x-axis, T being measured by a clock stationary at the fixed point. 
Defining the length of the bar in the S-frame to be uT, deduce the Fitzgerald 
contraction. 

4. The measuring rod employed by S will appear from S to be shortened by a 
factor (1 - u 2 /c 2 ) 1 ' 2 . Hence, when S measures the length of the bar fixed in S he 
might be expected to obtain the result 

/ = 77(1 -u 2 /c 2 ) 1 ' 2 

This contradicts equation (6.3), Resolve the contradiction. (Hint: It will be 
observed from S that S fixes the position of the forward end of the bar first and the 
position of the rear end a time ul/c 2 later.) 

5. A bar lies stationary along the x-axis of S Show that the world-lines of the 
particles of the bar occupy a certain ‘band’ in the x,x 4 -plane. By measuring the 
width of this ‘band’ parallel to the x,-axis, deduce the Fitzgerald contraction. 

6. Verify that the Lorentz transformation equations (5.8) can be expressed in 
the form 


x = xcosh a -asinh a y = y 
cT = ctcosh a - xsinh a z=z 
where tanhx = u/c. Deduce that 

x - cT = (x — cOe", x + cT = (x + ct)e~ , ‘ 
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Hence show that x 2 —c 2 t 2 is invariant under this transformation. The clocks 
moving with the S-frame are observed from the S-frame at the time r. Certain of 
them are seen to indicate the same time t. Show that these will lie in a plane relative 
to S and that this plane moves in S with velocity ctanhja. 

7. T wo light pulses are moving in the positive direction along the x-axis of the 
frame S, the distance between them being d. Show that, as measured in S, the 
distance between the pulses is 


d 


c + u 


8. A and B are two points of an inertial frame S a distance d apart. An event 
occurs at B a time T (relative to clocks in S) after another event occurs at A. 
Relative to another inertial frame S, the events are simultaneous. If AP is a 
displacement vector in S representing the velocity of S relative to S , prove that P 
lies in a plane perpendicular to AB, distance c 2 T/d from A. 

9. S, S are the inertial frames considered in section 5. The length of a moving 
rod, which remains parallel to the x and x axes, is measured as a in the frame Sand 
a in the frame S. By consideration of a Minkowski diagram for the rod, or 
otherwise, show that the rest length of the rod is 

aapu/c 

^(Ijiad-a 2 - a 2 ) 

10. If the position vectors r = (x,}\ 2 ),r = (x,y, 2 ) of an event as determined by 
the observers in the parallel inertial frames S, S respectively are mapped in the 
same independent <? 3 , prove that 

f = r + u l)+0t j 

T = fi{i + u-r/c 2 ) 

where u is the velocity of S as measured from S. 

1 1. A car 5 m long is to be placed in a garage only 3 m long. It is driven into the 
garage at four-fifths the speed of light c m/s by the owner; just before the bumpers 
strike the wall (which withstands the impact), show that the owner’s wife can slam 
the doors. Calculate the length of the garage as seen by the driver and prove that 
he estimates that the car strikes the wall 4/c seconds before the doors are closed. 
Hence explain how the car fits into the garage from his point of view. 

12. S, S are the two inertial frames related by the special Lorentz equations and 
u is the velocity of S relative to S. At r = Oin S, particles A and Bare at the points 
(0, 0, 0) and ( d , 0, 0) respectively. Thereafter, both particles move along the x-axis 
with speed v a constant distance d apart. Write down equations describing the 
motion of the particles and, by transforming these to the frame S show that, in this 
frame, the particles are observed to move with a speed 

v — u 
1 - uv/c 2 



19 


a distance 

d{ 1 -u 2 /c 2 )' 2 
1 - uv/c 2 

apart. 

13. S and S are the usual inertial frames having relative velocity u. A point 
moves along the x-axis with constant acceleration c/z starting from rest at O at 
f = 0. Write down its equation of motion in the frame Oxyz. If u = c/ <J2, show 
that its motion in the frame Oxy 2 is determined by the equation 

x 2 -2 c[t + r)x + 2c 2 i(i + z) = 0 

Deduce that, if t is small compared with r, then 


x = 72 (I + !/4t) 

14 Two men are stationary in the S-frame at points on the x-axis separated by 
a distance d. They tire light pulses at one another simultaneously. In the S-frame, 
show that one man A fires a time jiud/c 2 before the other man Band that, at the 
instant B fires, /fs missile is still approaching B and is distant 


\ c +• u ) 


from him. 

1 5. S and S are the usual pair of inertial frames having relative velocity u. The 
xz-plane of S is the surface of a lake. Waves are being propagated over this surface 
in the direction of the x-axis and are described by the equation y = asin2nf{t 
- x/w'), /being the frequency and w the wave velocity. Obtain the equation which 
describes this wave motion in S and deduce that the frequency and wave velocity 
in this frame are given by 


f =f Pi . 1 - “/*'). vv' = — J. 

1 — uw/c 

16. S, S are inertial frames. When observed from S, two events are simul- 
taneous and at a distance D apart. When observed from S, the time interval 
between the events is T. Calculate the distance between the events when observed 
from S and if, when viewed from S, the direction of motion of S relative to S 
makes an angle 0 with the line joining the events, show that the relative speed of 
the frames is 

D 2 

c(! +^iyr cos20 > ‘ 2 - 


17. Observed from a frame S, events A and B lie on the x-axis and B occurs a 
time T after A; the distance between the events is D. Calculate the velocity u of the 
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frame S relative to S if, observed from S, the event B occurs a time T before A. 
What is the distance between the events as observed from S? (Assume D > cT.) 
(Ans. u = 2 c 2 DT/(D 2 + c 2 T 2 ), D = D.) 

18. In the frame S, a particle is projected from O at r = 0 with the velocity 
components (jc, jc, 0)and thereafter moves so that its acceleration is constant of 
magnitude g and is directed along the y-axis in the negative sense. Write down the 
xand y coordinates of the particle a time t later. The particle’s motion is observed 
from the S frame. If u = jc, using the transformation equations (5.8), obtain 
equations for its coordinates (x, y) at time T in this frame. Deduce that the angle 
made by the velocity of projection with the x-axis is 60° and that the particle’s 
acceleration is 48c//25. 

19. The clocks at the origins of the frames S, S are synchronized to read zero 
when they pass one another and the clocks stationary in either frame are 
synchronized with the clock at the origin of the frame. At time t in S, the clock at O 
passes a clock C fixed to the x-axis of S. Show that C registers t and O registers 
^(1 -u 2 ,'c 2 )t at this instant. Observed from S, clock C is slowed by a factor 
^/(l -u 2 /c 2 ); in this frame, therefore, it might be expected that when C registers /, 
O would register t/ yj( 1 - u 2 /e 2 ). Resolve the paradox by showing that, in the S 
frame, the clock C is not synchronized with O, but that C is always u 2 t/c 2 ahead of 
O. 


20. In the frame S, at t = 1, a particle leaves the origin O and moves with 
constant velocity in the xy-plane having components v x = 5c /6, v y — c/3. What 
are the coordinates (x, y) of the particle at any later time t? If the velocity of 
relative to S is u = 3c/5, calculate the coordinates (x, y ) of the particle at time T in 
and deduce that the particle makes its closest approach to (5 at time T = 220/1 1 3. 

21. S, S are the usual parallel frames with the origin O of S moving along the x- 
axis of S with velocity u. An observer A is stationed on the x-axis at x = a and an 
observer A is stationed on the x-axis at x = a. Show that, in both the frames, the 
events (i) O passes O and (ii) A passes A, are separated by a time T, but that the 
order of occurrence of these events is different. Calculate the value of T. If T 
= a/3c, show that u = 3c/5. 


coj c/)j 



CHAPTER 2 


Orthogonal Transformations. Cartesian 
Tensors 


8. Orthogonal transformations 

In section 4 events have been represented by points in a space <? 4 . The resulting 
distribution of points was described in terms of their coordinates relative to a set 
of rectangular Cartesian axes. Each such set of axes was shown to correspond to 
an observer employing a rectangular Cartesian inertial frame in ordinary <? 3 - 
space and clocks which are stationary in this frame. In this representation, the 
descriptions of physical phenomena given by two such inertial observers are 
related by a transformation in from one set of rectangular axes to another. 
Such a transformation has been given at equation (4.9) and is called an orthogonal 
transformation. In general, if x h x,(i = 1, 2, ... , N) are two sets of N quantities 
which are related by a linear transformation 

/v 

*i= Z a.jXj + h, (8.1) 

j - i 

and, if the coefficients a of this transformation are such that 

Z (*i - Ti) 2 = Z (*. ~ >',) 2 (8.2) 

i = 1 i = I 

is an identity for all corresponding sets x,-, x f and >•„ y h then the transformation is 
said to be orthogonal. It is clear that the x* x, may be thought of as the 
coordinates of a point in S N referred to two different sets of rectangular Cartesian 
axes and then equation (8.2) states that the square of the distance between two 
points is an invariant, independent of the Cartesian frame. 

Writing z { = x, — z ; = x { — y„ it follows from equation (8.1) that 

z. = Z a,j z ) (8.3) 

j ~ 1 

Let z denote the column matrix with elements z„z the column matrix with 
elements z ( and A the N x N matrix with elements a ty Then the set of equations 

21 
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(8.3) is equivalent to the matrix equation 


tNJ 

II 

IN 

(8.4) 

Also, if z is the transpose of r. 


tM_ 

M 

II 

N 

-• K» 

(8.5) 

and thus the identity (8.2) may be written 


z'z = z z 

(8.6) 

But, from equation (8.4), 


N i 

II 

(8.7) 

Substituting in the left-hand member of equation (8.6) from equations (8.4), (8.7), 
it will be found that 

z A' Az = z z 

(8.8) 

This can only be true for all z if 


A' A = / 

(8.9) 

where / is the unit N x N matrix. 

Taking determinants of both members of the matrix equation (8.9), we find that 
\A\ 2 = 1 and hence 

II 

1+ 

(8.10) 

A is accordingly regular. Let A 1 be its inverse. Multiplication on 
A ~ 1 of both members of equation (8.9) then yields 

the right by 

A’ = A~ 1 

(8.11) 

It now follows that 


AA’ = AA~ 1 = 1 

(8.12) 

Let d ti be the yth element of /, so that 



(8.13) 


The symbols <5, ; are referred to as the Kronecker deltas. Equations (8.9), (8. 1 2) are 
now seen to be equivalent to 

X «.>«« = £>* 18 . 14 ) 

i = 1 
N 

X Ojitin = <5j k 

i = 1 


( 8 . 15 ) 
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respectively. These conditions are necessarily satisfied by t he coefficients a it of the 
transformation (8. 1 ) if it is orthogonal. Conversely, if either of these conditions is 
satisfied, it is easy to prove that equation (8.6) follows and hence that the 
transformation is orthogonal. 

9. Repeated-index summation convention 

At this point it is convenient to introduce a notation which will greatly abbreviate 
future manipulative work. It will be understood that, wherever in any term of an 
expression a literal index occurs twice, this term is to be summed over all possible 
values of the index. For example, we shall abbreviate by writing 

N 

£ a r b, = a r b r (9.1) 

r - I 

The index must be a literal one and we shall further stipulate that it must be a 
small letter. Thus a s b N are individual terms of the expression a,b r , and no 
summation is intended in these cases. 

Employing this convention, equations (8.14) and (8.15) can be written 

a ik = <5,k, a p a kl = 6 jk (9.2) 

respectively. Again, with z, = x, - y,, equation (8.2) may be written 

ZjZ, = :,z, (9.3) 

More than one index may be repeated in the same term, in which case more 
than one summation is intended. Thus 

,V .V 

a ijbjk<-k = £ X fl i jbjk c k (9.4) 

j = I k = l 

It is permissible to replace a repeated index by any other small letter, provided the 
replacement index does not occur elsewhere in the same term. Thus 

a,b, = dpi = a k b k (9.5) 

but 

a,ja lk ± a n a jk (9.6) 

irrespective of whether the right-hand member is summed with respect to j or not. 
A repeated index shares this property with the variable of integration in a definite 
integral. Thus 

j/(x)dx = j/(>')dy (9.7) 

a a 

A repeated index is accordingly referred to as a dummy index. Any other index will 
be called a free index. The same free index must appear in every term of an 
equation, but a dummy index may only appear in a single term. 
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The reader should note carefully the identity 

8„a J = a, (9.8) 

for it will be of frequent application. 8^ is often called a substitution operator, 
since when it multiplies a symbol such as a ; , its effect is to replace the index j by i. 

10. Rectangular Cartesian tensors 

Let x ; , y, be rectangular Cartesian coordinates of two points Q, P respectively in 
Writing 2 , = x, - y h the are termed the components of the displacement 
vector PQ relative to the axes being used. If x it y, are the coordinates of Q, P with 
respect to another set of rectangular axes, the new coordinates will be related to 
the old by the transformation equations (8.1). Then, if z, are the components of 
PQ in the new frame, it follows (equation (8.3)) that 

z, = a^Zj (10.1) 

Any physical or geometrical quantity A having N components A, defined in the 
x-frame and N components A, similarly defined in the x-frame, the two sets of 
components being related in the same manner as the components z„ z, of a 
displacement vector, i.e such that 

A, = a i,A, (10.2) 

is said to be a vector in& N relative to rectangular Cartesian reference frames. We 
shall frequently abbreviate 'the vector whose components are A,' to 'the vector 
A,'. 

If A,, Bj are two vectors, consider the N 2 quantities A,B r Upon transformation 
of axes, these quantities transform thus: 

AiB, = a ik aj,A k B, (10.3) 

Any quantity having N 2 components C,; defined in the x-frame and N 2 
components C,y defined similarly in the x-frame, the two sets of components 
being related by a transformation equation 

C,j = a ik a^C kl (10.4) 

is said to be a tensor of the second rank. We shall speak of ‘the tensor C,f. Such a 
tensor is not, necessarily, representable as the product of two vectors. 

A set of N 3 quantities D ijk which transform in the same manner as the product 
of three vectors A, B } C k , form a tensor of the third rank. The transformation law is 

D, jk = (10.5) 

The generalization to a tensor of any rank should now be obvious. Vectors are, 
of course, tensors of the first rank. 

If A u , B {J are tensors, the sums A^A- B tJ are N 2 quantities which transform 
according to the same lawas the A jy and B tJ . The sum of two tensors of the second 
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rank is accordingly also a tensor of this rank. This result can be generalized 
immediately to the sum of any two tensors of identical rank. Similarly, the 
difference of two tensors of the same rank is also a tensor. 

Our method of introducing a tensor implies that the product of any number of 
vectors is a tensor. Quite generally, if A tj , are tensors of any ranks 

(which may be different), then the product A t t B k , is a tensor whose rank is 
the sum of the ranks of the two factors. The reader should prove this formally for 
a product such as A tj B klm , by writing down the transformation equations. (N.B. 
the indices in the two factors must be kept distinct, for otherwise a summation is 
implied and this complicates matters; see section 12.) 

The components of a tensor may be chosen arbitrarily relative to any one set of 
axes. The components of the tensor relative to any other set are then fixed by the 
transformation equations. Consider the tensor of the second rank whose 
components relative to the x,-axes are the Kronecker deltas In the x r frame, 
the components are 

= u ik a lt S kl = a lk a jk = <5„ (10.6) 

by equations (9.2). Thus this tensor has the same components relative to all sets of 
axes. It is termed the fundamental tensor of the second rank. 

If, to take the particular case of a third rank tensor as an example, 

A ijk = A iik (10.7) 

for all values of i, j, k , A iJk is said to be symmetric with respect to its indices i,j. 
Symmetry may be with respect to any pair of indices. If A ijk is a tensor, its 
property of symmetry with respect to two indices is preserved upon transform- 
ation, for 

Aju c = OjiOi„a k „ Aj m „ 

= a im a jl a knA mt „ 

= A ijk (10.8) 

where, in the second line, we have rearranged and put A, m „ = A m ,„. Unless a 
property is preserved upon transformation, it will be of little importance to us, for 
we shall later employ tensors to express relationships which are valid for all 
observers and a chance relationship, true in one frame alone, will be of no 
fundamental significance. 

Similarly, if 

A ijk =-A jik (10.9) 

for all values of i, j, k, A uk is said to be skew-symmetric or anti-symmetric with 
respect to its first two indices. This property also is preserved upon transform- 
ation. Since A ilk = - A ilk , A llk = 0. All components of A ijk with the first two 
indices the same are clearly zero. 

A tensor whose components are all zero in one frame, has zero components in 
every frame. A corollary to this result is that if A tj . . . , . . are two tensors of 
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the same rank whose corresponding components are equal in one frame, then 
they are equal in every frame. This follows because A it — B, ; is a tensor 
whose components are all zero in the first frame and hence in every frame. Thus, a 
tensor equation 

A,,.. = (10.10) 

is valid for all choices of axes. 

This explains the importance of tensors for our purpose. By expressing a 
physical law as a tensor equation in <^ 4 , we shall guarantee its covariance with 
respect to a change of inertial frame. A further advantage is that such an 
expression of the law also implies that it is covarianl under a rotation and a 
translation of axes in<£ 3 , thus ensuring that the law conforms to the principles of 
isotropy and homogeneity of space. 

The first principle states that all directions in space are equivalent in regard to 
the formulation of fundamental physical laws. Examples are that the inertia of a 
body in classical mechanics is independent of its direction of motion and that the 
power of attraction of an electric charge is the same in all directions. In the vicinity 
of the earth, the presence of the gravitational field tends to cloud our perception 
of the validity of this principle and the vertical direction at any point on the 
surface is sharply distinguished from any horizontal direction But this is a purely 
local feature and the crew of a spaceship have no difficulty in accepting the 
principle. Mathematically, the principle requires that the equation expressing a 
basic physical law must not change its form when the reference frame is rotated. 
Laplace’s equation V 2 V = 0 is well known to possess this property, whereas the 
equation cV/cx + cV/cy + cVjcz = 0 does not; this explains why Laplace's 
equation occurs so frequently in mathematical physics, whereas the other 
equation does not. 

The second principle affirms that all regions in space are also equivalent, i.e. 
that physical laws are the same in all parts of the cosmos. Covariance under a 
translation of axes is the mathematical expression of this requirement. 

Both principles are almost certainly a consequence of the uniformity with 
which matter and radiation are distributed over the universe. It is doubtful 
whether either would be valid in a cosmos not possessing this property. 

A less well-established third principle is that of spatial parity. This requires that 
physical laws should be impartial as between left- and right-handedness. The 
mathematical formulation of a law obeying this principle will be covariant under 
a transformation from a right-handed to a left-handed Cartesian frame or vice 
versa. Another way of expressing this principle is that, if the universe were 
observed in a mirror, the laws which would appear to govern its behaviour would 
be identical with the actual laws. For example, observation of the planetary 
motions in a mirror would alter their senses of rotation about the sun, but the law 
of gravitation would be unaffected. Although the more familiar laws are in 
conformity with this principle, those governing the behaviour of some funda- 
mental particles do not appear to have this simple symmetry. Provided that the 
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orthogonal transformations upon which our tensor calculus is being built are not 
restricted to be such that \a\ = +1 (i.e. transformations between frames of 
opposite handedness are permitted), all tensor (and pseudotensor) equations will 
be in conformity with the principle of spatial parity. 

From what has just been said, it is evident that the calculus of tensors is the 
natural language of mathematical physics, relativistic or non-relativistic. It 
guarantees that the equations being considered are of the type which can 
represent physical laws. However, in classical physics, a three-dimensional theory 
was adequate to ensure conformity with the principles of isotropy, homogeneity 
and parity of space. In relativistic physics, a four-dimensional theory is needed to 
incorporate the additional special principle of relativity. 


II. Invariants. Gradients. Derivatives of tensors 

Suppose that Fisa quantity which is unaffected by any change of axes. Then Fis 
called a scalar invariant or simply an invariant. Its transformation equation is 
simply 

F = V (11.1) 

As will be proved later (section 24), the charge of an electron is independent of the 
inertial frame from which it is measured and is, therefore, the type of quantity we 
are considering. 

If a value of Fis associated with each point of a region of fi s , an invariant field 
is defined over this region. In this case F will be a function of the coordinates Xj. 
Upon transformation to new axes, F will be expressed in terms of the new 
coordinates x,; when so expressed, it is denoted by V. Thus 

F (x,, x 2 , . . . , x N ) = F (x , , x 2 , . . . , x N ) (1 1.2) 

is an identity. The reader should, perhaps, be warned that Fis not, necessarily, the 
same function of the x< that F is of the x,. 

If Aj j is a tensor, it is obvious that V A is also a tensor of the second rank. It is 
therefore convenient to regard an invariant as a tensor of zero rank. 

Consider the N partial derivatives cF/cx ( . These transform as a vector. To 
prove this it will be necessary to examine the transformation inverse to (8.1). In 
the matrix notation of section 8, this may be written 

x = A ~ 1 (x -b) = A' (x — b) (11.3) 

having made use of equation (8.11), Equation (11.3) is equivalent to 

*, = a'ifixj-bj) (11.4) 

where a',, is the i/th element of A'. But a-j = a ti and hence 

(Xj-bj) 


(11.5) 
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It now follows that 


and hence that 


c V cV cXj G V 

Dx i cXjCXj '’dxj 


( 11 . 6 ) 


(11-7) 


proving that GV /ox, is a vector. It is called the gradient of V and is denoted by 
grad V or V V. 

If a tensor A tj is defined at every point of some region of the result is a 
tensor field. The partial derivatives 8A,j Jcx, can now be formed and constitute 
a tensor whose rank is one greater than that of A ( j . We shall prove this for a 
second rank tensor field A it . The argument is easily made general. We have 


by equation (1 1.6). 


d ~ = 

CX k OX k 

5 .c 

T {tii f Qj 5 A rs )~ 

ex, c. 

Gx, 


cx, 
cx k 


(H.8) 


12. Contraction. Scalar product. Divergeuce 

If two indices are made identical, a summation is implied. Thus, consider A iik . 
Then 


Aijj — <4i| I + -4i22 + • ■ ■ + ^i\.v 


( 12 . 1 ) 


There are N 3 quantities A tjk . However, of the indices in A, J)y only i remains free to 
range over the integers 1,2, ... , N,and hence there are but N quantities A i)t and 
we could put B, = A, h . The rank has been reduced by two and the process is 
accordingly referred to as contraction. 

Contraction of a tensor yields another tensor. For example, if B, = A :jj then, 
employing equations (9.2), 

B, ®iqttj r aj$Aq rs a,qd rs Aq r J a,qAq rr a,qBq (12.2) 

Thus B, is a vector. The argument is easily generalized. 

In the special case of a tensor of rank two, e.g. A ij7 it follows that A u = i.e. 
A,, is an invariant. Now, if A,, B, are vectors, ,4,6, is a tensor. Hence, A^B, is an 
invariant. This contracted product is called the inner product or the scalar product 
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of the two vectors. We shall write 

AjB, = A - B (12.3) 

In particular, the scalar product of a vector with itself is an invariant. The positive 
square root of this invariant will be called the magnitude of the vector. Thus, if A is 
the magnitude of A„ then 

A 2 = A,A i = AA = A 2 (12.4) 

In # 3 , if 0 is the angle between two vectors A and B, then 

ABcos 0 = A ■ B (12.5) 

In <S N , this equation is used to define 6. Hence, if 

A- B = 0 (12.6) 

then 0 = j7t and the vectors A, B are said to be orthogonal. 

If A, is a vector field, oAJox j is a tensor. By contraction it follows that cAjcx , is 
an invariant. This invariant is called the divergence of A and is denoted by div A. 
Thus 


.. * SA ‘ 
div A = - — 
ox, 


(12.7) 


More generally, if A i} . is a tensor field, cA,, Jcx, is a tensor. This tensor 
derivative can now be contracted with respect to the index r and any other index 
to yield another tensor, e.g. 5A, ; JSxj. This contraction is also referred to as the 
divergence of A,y with respect to the index j and we shall write 


13. Pseudotensors 


? A ij . . 

CXj 


diV;A,y . . 


( 12 . 8 ) 


is a pseudotensor if, when the coordinates are subjected to the transformation 
(8.1), its components transform according to the law 

N,,-= (13.1) 

| A | being the determinant of the transformation matrix A. Since for orthogonal 
transformations | A | = +1 (equation (8.10)), relative to rectangular Cartesian 
frames, tensors and pseudotensors are identical except that, for certain changes of 
axes, all the components of a pseudotensor will be reversed in sign. For example, 
if in <?3 a change is made from the right-handed system of axes to a left-handed 
system, the determinant of the transformation will be - 1 and the components of 
a pseudotensor will then be subject to this additional sign change. 

Let e, ; „ be a pscudotensor of the Nth rank which is skew-symmetric with 

respect to every pair of indices. Then all its components are zero, except those for 
which the indices /, j, . . . , n are all different and form a permutation of the 
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numbers 1,2, . . . , N. The effect of transposing any pair of indices in e tJ „isto 
change its sign. It follows that if the arrangement i,j, . . . , n can be obtained from 
1, 2, .... N by an even number of transpositions, then e u „ = + e l2 N , 
whereas if it can be obtained by an odd number „ = - e 12 N . Relative to 
the x,-axes let e 12 N = 1. Then, in this frame, e j; „ is 0 if i,j, . . . , n is not a 
permutation of 1 , 2, . . . , N, is +1 if it is an even permutation and is - 1 if it is an 
odd permutation. Transforming to the x^axes, we find that 

^12 ... A/ = M \ a li a 2 j ■ • • fl A In *ij . . it = \ A | — 1 (1 3.2) 

But e i; „ is also skew-symmetric with respect to all its indices, since this is a 
property preserved by the transformation. Its components are also 0, + 1 
therefore and e I2 „ is a pseudotensor with the same components in all frames. 
It is called the Levi-Civita pseudotensor. 

It may be shown without difficulty that: 

(i) the sum or difference of two pseudotensors of the same rank is a 
pseudotensor. 

(ii) the product of a tensor and a pseudotensor is a pseudotensor. 

(iii) the product of two pseudotensors is a tensor. 

(iv) the partial derivative of a pseudotensor with respect to x, is a pseudotensor. 

(v) a contracted pseudotensor is a pseudotensor. 

Thus, to prove (iii), let '21,, 58* be two pseudovectors. Then 

% = \A [' 2 a ih a fl % S, = o lk a Jt % », (13.3) 

The method is clearly quite general. The remaining results will be left as exercises 
for the reader. 

14. Vector products. Curl 

Throughout this section we shall be assuming that N = 3, i.e. the space will be 
ordinary Euclidean space. 

Let At , B, be two vectors. Then e iJk A,B s is a pseudotensor of rank 5. 
Contracting twice, we get the pseudovector 

G, = e iJk A j B k (14.1) 

whose components are 

= *4 2 B$ <4382 ") 

<S 2 = A i B l -A t B 3 V (14.2) 

Gj - A 1 8 2 — A 2 B j J 

Provided we employ only right-handed systems of axes or only left-handed 
systems in & 3 , G, is indistinguishable from a vector. If, however, a change is made 
from a left-handed system to a right-handed system, or vice versa, the 
components of G< are multiplied by — 1 in addition to the usual vector 
transformation. Since it is usual to employ only right-handed frames, G* is often 
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referred to as a vector (or an axial vector) and treated as such. It is then called the 
vector product of A and B and we write 

(E=AxB (14.3) 

Vector multiplication is non-commutative, for 

B x A = e i , k B j A k = - c, ki A k B J = - A x B (14.4) 

having made use of e itj = - t ijk . However, vector multiplication obeys the 
distributive law, for 

A x (B + C) = e ijk Aj{B k + C k ) = e ijk AjB k + t ijk AjC k 

= AxB+AxC (14.5) 

We now introduce the abbreviated notation dAJcx , = A, . Any index after a 
comma will hereafter indicate a partial differentiation with respect to the 
corresponding coordinate; thus. A, jk is a second derivative. 

Now suppose Aj is a vector field. We can first construct a pseudotensor of rank 
5 e ijk A r s . Contracting twice, we get the pseudovector 

= t iik A k , j (14.6) 

This has components 

51 = 

1 ~ 5x 2 cx 3 

^ cA, cA 2 

2 dx } cx, 

_ &A i 

3 dx, Sx 2 1 

and is denoted by curl A. It, also, is an axial vector. 

Equation (14.1) can still be employed to define a vector product when either or 
both of the vectors A, Bare replaced by pseudotensors. If only one is replaced by a 
pseudovector the right-hand member of equation (14.1) will involve the product 
of two pseudovectors and a vector. The resulting vector product will then be a 
vector. Similarly, by replacing A in equation (1 4.6) by a pseudovector, the curl of a 
pseudovector is defined as an ordinary vector. 

Exercises 2 

1. Show that, in two dimensions, the general orthogonal transformation has 
matrix A given by 

cos 6 sin 9\ 

- sin 9 cos 9 I 
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Verify that | A | = 1 and that A 1 = A'. T if is a tensor in this space. Write down in 
full the transformation equations for all its components and deduce that 7], is an 
invariant. 

2. x = Ax, x = Bx are two successive orthogonal transformations relative to 
each of which T tj transforms as a tensor. Show that the resultant transformation x 
= BAx is orthogonal and that T tJ transforms as a tensor with respect to it. 

3. If A ; , B, are vectors and X tJ A,B l is an invariant, prove that X (J is a tensor. 

4. Verify that the transformation 

x, = ^(5x, - 14x 2 + 2x 3 ) 
x 2 = — ^( 2 x, + x 2 +2x 3 ) 

Xj = |^(1 Ox, + 2x 2 - llx 3 ) 

is orthogonal. A vector field is defined in the x-frame by the equations A t = x\, 
A 2 = xj, A 3 = X 3 . Calculate the field in the x-frame and verify that divA is an 
invariant. 

5. A ijk is a tensor, all of whose components are zero, except for the following: 
Am = A 222 = 1 ,A 212 = — 2. Calculate the components of the vector A ijt . Show 
that the transformation 

x, = 4(-3x, - 6x 2 -2x 3 ) 
x 2 = t(-2x, +3x 2 - 6 x 3 ) 
x 3 =y( 6 x, -2x 2 -3x } ) 

is orthogonal and calculate the component A , 23 of the tensor in x-frame. Write 
down the equations of the inverse transformation. If B tj is a tensor whose 
components in the x-frame all vanish except that B l} = 1, calculate B )2 . 
(Ans. (- 1, 1, 0); 120/343; 6/49.) 

6 . If A = (/ — B)(l + B )~ ', where B is a skew-symmetric matrix, show that A is 
orthogonal. Taking 

0 2 2 
B = -2 0 0 
-2 0 0 

calculate A and write down the rectangular Cartesian coordinate transformation 
equations x = Ax. In the x-frame, the tensor C u is skew-symmetric and C l2 
= C ,3 = 1 , C 2 3 = 0. Calculate the component C , 2 in the x-frame. In the x-frame, 
all the components of the tensor D iJk vanish except the following Z>, 2 , = - 1, 
D 122 =*■ 2, £> 123 = 5. Calculate the component £>, , , in the x-frame. Calculate the 
componentsof the vectors £),yyand C u D iJk in the x-frame. (Ans. C, 2 = = 

-980/729; (-14/9, -8/9, -8/9); (35/9, -32/9, -7/9).) 
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7. A,, is a tensor field defined in the x-frame by the equation A,j = x,x r 
Calculate its components at the point P where x, =0, x 2 = x i = 1. The 
coordinates x, of a point in the x-frame are related to the coordinates x ; of the 
same point in the x-frame by the equations 

x, = 2 (-3x, - 6 x 2 — 2x 3 ) 
x 2 = 7 ( — 2x, + 3 x 2 — 6 x 3 ) 

S 3 = -2x 2 - 3x 3 ) 

Calculate the component A , , of the tensor field at P. In the x-frame, prove that 
Ajj,j = 4Xj, Aifijj = 12. (Ans. A u = 64/49.) 

8 . x is the position vector of a point P with respect to an origin O. OP is rotated 
through an angle 8 about an axis whose direction is determined by the unit vector 
u. If the new position vector of P is x, prove that 

x = xcos 0 + (1 - cos d) (x-u)u + u x x sin 0 

Deduce that the coordinate transformation generated when a rectangular 
Cartesian frame Ox,x 2 x 3 is rotated through an acute angle sin 1 (4/5) about an 
axis through O having direction ratios (1, 2, 2) to give a new frame Ox, x 2 x 3 is 

45x, = 29x, +28x 2 - 20x 3 
45x 2 = -20x, + 35x 2 +20x 3 
45x 3 = 28 x, — 4x 2 + 35 x 3 

9. Verify that the transformation 

x, = g (x, -8x 2 + 4x 3 ) 
x 2 = g (4x, + 4 x 2 + 7 X 3 ) 
x 3 = g( 8 x, x 2 — 4x 3 ) 

is orthogonal. In the x-frame, the tensor A tJ is skew-symmetric and A, 2 = -4 , 3 
= 1, < 42 3 = 0- Calculate the component -4 , 2 in the x-frame. In the x-frame, all the 
components of the tensor B ijk vanish except the following: B , 21 = — 1,B , 22 = 2, 
®i 2 3 = 5. Calculate the component B m in the x-frame. Calculate the com- 
ponents of the vectors B t)J and AijB iJk in the x-frame. (Ans. A , 2 = 1/3; B, n 
= 188/729; 2/9, - 16/9, 8/9; 47/27, 11/27, - 10/27.) 

10. In the x-frame in <f 3 a tensor field is defined by the equation A iJk = x , 2 
+ 2xf + x k . Calculate the divergence of the vector field A lJr Also, calculate the 
curl of the vector field A ijt . (Ans. 1 6 (x , -t-x 2 +x 3 ); 6 (x 2 — x 3 ), etc.) 

11. A pair of rectangular Cartesian frames are related by the equations 

x, =rs(5x, — 14 x 2 + 2x 3 ) 
x 2 = - j( 2 x, +x 2 +2x 3 ) 

* 3 = rs( 10 x, + 2 x 2 - llx 3 ) 
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A ljk is a tensor, all of whose components vanish in the x-frame except the 
following: 4, , , = 4 222 = 2, 4, 22 = 4, 4 233 = 1 3 Calculate (a) the components 
of the vector A in in the x-frame. and (b) 4 123 . If B tJ is a tensor whose components 
in the x-frame all vanish except B 12 , B 23 , which are both unity, calculate B n . 
If Kisan invariant field given in the x-frame by V = xj, calculate the field in the x- 
frame and the components of grad V in this frame, where x, = x 2 = x 3 = 9. (Ans. 
(a) (- 12, -9, 6); (b) - 1396/225; B, , = -2/3; VK= (2, -4,4).) 

12. IntheframeOx,x 2 x 3 ,allcomponentsof 4 i; are zero except 4 )2 = - A 2 , 
= 1. If the transformation equations to the x-frame are 

x | = x, cos x + x 2 sin x 
x 2 = — x, sin x + x 2 cos x 

*3 = *3 

prove that A l2 = 1 

13. If A, j = x? +Xj (i.j = 1 , 2 , 3), prove that (a) A irj = 2 (x t + x 2 +x 3 -t-xj 

(bl 4 (J „, 12. 

14. Verify that the coordinate transformation 

25x, = 9x, + 20 x 2 + 12x 3 
25 x 2 = 12x, - 15 x 2 4- 16x 3 
5x 3 = - 4x, + 3 x 3 

is orthogonal and calculate the component A in of the tensor A lJk in the x-frame 
if all its components in the x-frame vanish except for the following: 4, 23 = 25, 
4 222 = - 6. All components of the tensor B h in the x-frame are zero, except for 
the components fl l3 (i = 1, 2, 3). Show that, in the x-frame, all the components 
B, 2 (i = 1. 2, 3) vanish. (Ans. — 192 / 25 .) 

15. (i) x is a column matrix such that x'x = 1 . If A = I — 2xx’, where l is a unit 
matrix, prove that A is orthogonal. If x' = x(l, -2,3) where x is a scalar 
multiplier, calculate x and hence find 4. Written in matrix form, the coordinate 
transformation between two rectangular Cartesian frames is x = 4x (4 is the 
matrix just calculated). In the x-frame, all the components of the tensor B iJkl are 
zero, except that B, 133 = - 20, B, 232 = 29. Calculate the component B 222 1 in 
the x-frame. What is the value of If is a pseudotensor with components 
(2, 3, 6) in the x-frame, find its components in the x-frame. (Ans. B 222 , = 72/49; 
B ih ,= - 20 ; ( 0 , - 7 , 0 ).) 

(ii) 4, 2 (x,, x 2 , x 3 ) is a tensor field. If 4,, = f>,;-x 2 , where <5,y is the Kronecker 
delta, list the non-zero components of cA lj /dx k . Deduce the value of c4, , /t x, at 
the point where x, = 1, x 2 = 2, x 3 = 1. (Take the frames to be related as in (i).) 
(Ans. c4 M /t?x, = 4/7.) 

16. Verify that the transformation 

7x, = 3x, + 6x 2 — 2x 3 
7x 2 = 2x, — 3x 2 — 6x 3 
7x 3 = 6x, — 2x 2 + 3x 3 
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is orthogonal and calculate the component A 32 of the tensor A u in the x-frame 
if all its components in the x-frame are zero except for the following: A, , = 10, 
A 2 2 = 29. If the only non-zero component of the tensor B ijk in the x-frame is 
B 2 32 = 343, calculate the component B, n in the x-frame. If £ i2 is a pseudotensor 
and £ 33 = 98, all other components being zero, calculate E, 2 . (Ans. A 32 = 6 ; 
B 111 = 24; (T 12 = —24.) 

17. If A is a square skew-symmetric matrix, show that A 2 is symmetric. If, also, 
A 1 — - A, show that the matrix B given by B = I + 2 A 2 is orthogonal. If 

I 0 a h\ 

A = l-u 0 c J 

'-b — c 0/ 

prove that A 2 = - A provided a 2 + b 2 +c 2 = 1. Taking a = 1/3, b = c = 2/3, 
calculate the orthogonal matrix B. Written in matrix form, the coordinate 
transformation between two rectangular cartesian frames is x = Bx. In the x- 
frame, the tensor C has all its components equal to 1. Calculate the component 
C 23 in the x-frame. In the x-frame, a vector field A, has components A, = x 2 +x 2 
+ X 3 , A 2 = A } = 0. Obtain formulae for the field’s components in the x-frame. 
Calculate the divergence of the field in both frames and show that the results are 
equal. (Ans. B 2i =91/81.) 

18. (i) The equations 

x, = !(ax, + 8 x 2 +4x 3 ) 
x 2 = 5 < 4 x, + bx 2 +7x 3 ) 
x 3 = £( 8 x, + x 2 +cx 3 ) 

represent a transformation between rectangular Cartesian axes. Calculate the 
values of a, b , c. In the x-frame, all the components of the tensor A tj are zero 
except A 23 = 9. Calculate the component in the x-frame. 

(Ans. a = 1, b = —4, c = —4; A 3 , = 4/9.) 

(ii) If A ijkl is a tensor, prove that A iiki is also a tensor. In the x-frame referred to 
in (i), all the components of the tensor A ijk , vanish except for the following: A, , 3 , 
= 4 - A 3 1 33 = 5, A 2222 = 18. Calculate all the components of the tensor A ijki in 
the x-frame and the (1, l)-component in the x-frame. (Ans. 44/3). 

19. Verify that the transformation 

*1 = rs(9x, + 20 x 2 + 12x 3 ) 
x 2 = 23(12x! — 15x 2 + I 6 X 3 ) 

*3 = s(4x, -3xj) 

is orthogonal and write down the inverse transformation. In the x-frame, all the 
components of the tensor A ijk vanish except A , , , = 1. Calculate the component 
A 1 23 in the x-frame. In the x-frame, all the components of the tensor B ijk vanish 
except that B 2 , 2 = 1, 63,3 = fl 222 = B I2I =2. Calculate the components of the 
vector B, 2i in the x-frame. (Ans. A 123 = 432/3125; (3,0,4).) 
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20. Show that, for all angles a, ft, the transformation 

x, = x, cos x cos /? + x 2 cos x sin /( - x 3 sin x 

x 2 = -x, sin/f + x 2 cos/i 

x 3 = x, sin x cos /? + x 2 sin x sin p + x 3 cos x 

is orthogonal. Obtain the form taken by the transformation when x = [i = n/4 
and. in this case, calculate the component A , 123 of the tensor A jjkl in the x-frame, 
if the only non-zero components in the x-frame are A t , 23 = /t 2213 = 1- (Ans. 0.) 

21. If A, B are orthogonal matrices of the same order, prove that AB is 
orthogonal. If 


I 3 

4 

°\ 

/ 3 

0 

4 \ 

-4 

3 

° • 

B = i 0 

5 

° 

' 0 

0 

5/ 

— 4 

0 

3/ 


calculate the orthogonal matrix C = AB. The coordinates x, and x, in two 
rectangular Cartesian frames are related by the transformation x = Cx. All the 
components of the tensor A, jk vanish in the x-frame except A 22l = 1,A 122 = 25. 
Calculate the component A 32 , in the x-frame. In the x-frame, the only non-zero 
components of the tensor B ijkl are B 32n = -5, B 3212 = 10, S 32 , 3 = 15. 
Calculate the components of the vector A ijk B l]kt in the x-frame and deduce its 
components in the x-frame. (Ans. J 321 = — 48/5; 4464/25, -96/5, -48/25.) 

22. If 91, 2 is a pseudotensor, 'B, is a pseudovector and E t is a vector, prove that 

is an invariant. 

23. A vector field has components in the x-frame given by 

<4, = sin(x 2 x 3 ). A 2 = cos(x 3 Xj), A 3 = tan(x,x 2 ) 

Calculate curl A at the point x, = x 2 = x 3 = jy/x. If the x-frame is obtained 
from the x-frame by reversing the sense of the x 2 -axis, what are the components 
of curl A in the x-frame? (Ans. y/n(l + 1/2 ^2), - ,Jn(\ -1/2 J2), -^(2n); 
- 1 -t-1/2 v '2), - VJt(l - 1 , 2y2), 

24. x,, x, (i = 1, 2, 3) are coordinates of the same point with respect to two 
different rectangular Cartesian frames. If 

x, = x(ux, -f 2x 2 + 5x 3 ) 
x 2 = /i(x , + hx 2 + 2x 3 ) 
x 3 = y(2x, - 1 lx 2 +ex 3 ) 

where x, /?, y are all positive, calculate the values of a, /?, y, a , b, c. If A iJk , is a tensor 
whose only non-zero components in the x-frame are /4 I313 = 1, 4 3223 = 20, 
calculate its component A 3121 in the x-frame. In the x-frame, the only non-zero 
components of the tensors A t , B tj are as follows: A, = 3, A 2 = I, B 13 = 5, B 22 
= 3. Calculate the components of the tensor AjB^ in the x-frame. (Ans. a =- — 14, 
b = 2, c = 10, a = 1/15, 0 = 1/3, y = 1/15; J 312l = 2/5; (3, -9, 12).) 

25. A is an anti-symmetric matrix such that A 3 = - A. Prove that the matrix B 
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= / + A + A 2 is orthogonal. Show that the matrix 


A = J 



satisfies the stated conditions and hence calculate the orthogonal matrix B. The 
coordinates of a point relative to a pair of rectangular Cartesian frames are 
related by the matrix transformation x = Bx. A tensor A u has all its components 
zero in the x-frame except that A i2 = 81. Calculate the component A 12 in the x- 
frame. If the tensor B ijk has all its components zero in the x-frame except B , 23 
= 729, calculate the component B 32I in the x-frame. (Ans. A i2 = 32; 
fl J2 , = -128.) 

26. In S 3 , prove that 

curl grad V = 0, div curl A = 0. 

27. In prove that 


(*) *1*1 *im« — ~ 

(**) e iU e .km = 


28. In $ 3 , show that 


V 2 V - div grad V = 


t 2 V d 2 V 6^V 
cxf + <5x 2 + 5x| 


c 2 V 

<3x,cx, 


29. In <? 3 , prove that 


curl curl A = grad div A — V 2 A 


(Hint: Employ Exercise 27(i),) 

30. In <? 3 , prove that 


(i) A x (B x C) = A • CB - A • BC 


(it) 


A BxC = 


A, 

A 2 

A, 


5. C, 
B z C 2 
B 3 C } 


31. In <S N , prove that 


divCA= CdivA + A- grad V 

32, In <f 3 , prove that 


(i) curl V A = V curl A - A x grad V 

(ii) div (A x B) = B-curl A - A -curl B 

(iii) curl(A xB)=B-VA-A-VB+AdivB-BdivA 

(iv) grad(A- B) = B- VA + A- VB+ A x curl B + B x curl A 
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where AVB=/l > B j> . 

33. If Ajj is a tensor and B tJ = A jh prove that B {j is a tensor. Deduce that if A {j 
is symmetric in one frame, it is so in all. 

34. Prove that <5, .*<5* = <5;* 

and that e iJk e lm „ has the value + 1 if i,j, k are all different and (Imn) is an even 
permutation of (ijk), - 1 if i,j, kart all different and {Imn) is an odd permutation of 
(ijk), and 0 otherwise. Deduce that 

jk ^il^jm^kn 4" ^int^jn^kl 4" ^in^jl^km 

-&in&jm&kl -&il & jn 6 km-&im&jl&kn 

Hence prove that 

e ,jk e ,m» = bjn.&kn-bj^km 

35. In (f 3 , prove that 

(i) (a x b)-(c x d) = acbd -adbc 

(ii) (a x b) x (c x d) = [acd]b — [bcd]a 

= [abd]c - [abc]d 

where [abc] = a • b x c. 



CHAPTER 3 


Special Relativity Mechanics 


15. The velocity vector 


Suppose that a point P is in motion relative to an inertial frame S. Let ds be the 
distance between successive positions of P which it occupies at times t, r+df 
respectively. Then, by equation (7.4), if dr is the proper time interval between 
these two events, 

dr = fdt 2 --j ds 2 ^ = ( 1 — u 2 /c 2 )' /2 dr (15.1) 


where i’ = ds/dt is the speed of P as measured in S. Now, as shown in section 7, dr 
is the time interval between the two events as measured in a frame for which the 
events occur at the same point. Thus dr is the time interval measured by a clock 
moving with P. dt is the time interval measured by clocks stationary in S. 
Equation (15.1) indicates that, as observed from S, the rate of the clock moving 
with P is slow by a factor (1 -t 2 /c 2 ) u2 . This is the phenomenon of time dilation 
already commented upon in section 6. If P leaves a point A at t = t, and arrives at 
a point B at t = t 2 , the time of transit as registered by a clock moving with P will 
be 


h 

-J 


(1 — tr/c 2 ) I,2 dr 


(15.2) 


The successive positions of P together with the times it occupies these positions 
constitute a series of events which will lie on the point’s world-line in Minkowski 
space time. Erecting rectangular axes in space-time corresponding to the 
rectangular Cartesian frame S, let x ( , x^-t-dx* be the coordinates of adjacent 
points on the world-line. These points will represent the events (x, y, z, t), (x + dx, 
>■ + dy, z +dz, t -t- dr) in S. If (r,, v y , c 2 )are the components of the velocity vector v 
of P relative to S, then 


i' 


X 


dx 

dr’ 


dy 
df ’ 




dz 

dt 


(15.3) 


v does not possess the transformation properties of a vector relative to 
orthogonal transformations (i.e. Lorentz transformations) in space -time. It is a 
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vector relative to rectangular axes stationary in S only. However, we can define a 
4 -velocity vector which does possess such properties as follows: dx, is a 
displacement vector relative to rectangular axes in space-time and dr is an 
invariant. It follows that dx./dr is a vector relative to Lorentz transformations 
expressed as orthogonal transformations in space-time. It is called the 4-velocity 
vector of P and will be denoted by V. 

V can be expressed in terms of v thus: 


dr 


dx, dr 
dt dr 


= (\-v 2 /c 2 ) 


,.2/„2y-|/2; 


(15.4) 


by equation (15.1). Also, from equations (4.4) we obtain 

x, = v x , x 2 = v y , x 3 = v x , x 4 = ic (15.5) 

It now follows from these equations that 

V = (1 - v 2 /c 2 )~ 1,2 (i x , v y , v x , ic) = (1 -v 2 /c 2 )~' 2 (v,ic) (15.6) 

where the notation should be clear without further explanation. 

Knowing the manner in which the components of V transform when new axes 
are chosen in space-time, equation (15.6) enables us to calculate how the 
components of v transform when S is replaced by a new inertial frame S. Thus, 
consider the orthogonal transformation (5.1) which has been interpreted as a 
change from an inertial frame S to another S related to the first as shown in Fig. 2. 
The corresponding transformation equations for V are 


(15.7) 


V, = F, cos a + F 4 sin a V 2 = V 2 

F 4 = — V, sin « + F 4 cos a F 3 = V 3 

By equation (15.6), these equations are equivalent to 

(1 ~v 2 /c 2 )~ l,2 v x = (1 —v 2 /c 2 )~ ll2 {v x cos a + ic sin a) 

(1 -v 2 /c 2 )~' l2 v > = (1 -v 2 /c 2 y' l2 v y 

(1 —v 2 /c 2 )-' i2 v 2 = (1 — v 2 /c 2 )~ i,2 v t 
(1 — v 2 /c 2 )~ 1/2 ic = (1 — v 2 /c 2 )~ l,2 ( — v x sin x + ic cos a) 

where v is the velocity of the point as measured in the frame S. Substituting for 
cos a, sin a from equations (5.7), equations (15.8) can be written 


(15.8) 


V* = Q(v, - u) 

V, = Q( 1 -u 2 /c 2 ) l,2 v y 

», = (?(!— u 2 /c 2 )' l2 v z 
1 = Q( 1 - uv x /c 2 ) 

e= r r 

[(1 — u 2 /c 2 )(l -u 2 /c 2 )J 


(15.9) 


where 


(15.10) 
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Dividing the first three equations (15.9) by the fourth, we obtain the special 
Lorentz transformation equations for v in their final form, viz. 


l * \-uvJc 2 

_ _ (1 — U 2 /C 2 )* ' 2 f y 
L * 1 — uvjc 2 

(1 -u 2 /c 2 )' 12 V, 

I, — = — - 

1 - uvjc 2 


(15.11) 


If u and t- are small by comparison with c, equations ( 1 5. 1 1 ) can be replaced by 
the approximate equations 

r* = v x - u, v t = v y , v z = v z (15.12) 

These are equivalent to the vector equation (1.1) relating velocity measurements 
in two inertial frames according to Newtonian mechanics. 

Since, by the fourth of equations (15.9), Q must be real, equation (15. 10) implies 
that if v < c then v < c. Thus, if a point is moving with a velocity approaching c in 
S and ? is moving relative to S with a velocity of the same order, the point’s 
velocity relative to S will still be less than c. Such a result is, of course, completely 
at variance with classical ideas. In particular, if a light pulse is being propagated 
along Ox so that v x = c, t v = c z = 0, then it will be found that i x = c, v y = v z = 0. 
This confirms that light is propagated with speed c in all inertial frames. 

The transformation inverse to (1 5,1 1 )can be found by exchanging ‘barred’ and 
unbarred’ velocity components and replacing u by — u. 

Suppose that particles A and B move along the x-axis of a frame S with speeds 
3c/4 in opposite directions, both leaving O at time t = 0. At any later time r, their 
x -coordinates will be x„ = -3cr/4, x 8 = 3cf/4 and their distance apart will be x B 
-x A = 3cr/2. Clearly, this distance increases at a rate 3c/2. However, this is not 
their relative velocity and the fact that the rate exceeds c does not conflict with the 
result already derived that no material body can be observed from any inertial 
frame to have a speed greater than c. There is no special-relativity prohibition 
against the distance between two bodies increasing at a rate greater thane. To find 
the velocity of B relative to A, it is necessary to introduce a second inertial frame S 
with its origin at A\ the velocity of B in S is then the velocity of B relative to A. 
Since the velocity of S relative to S is u = - 3c/4 and the velocity of B in S is v x 
= 3c/4, the first of equations (1 5.1 1) gives v x = 24c/25 ( < c) as the velocity of B 
relative to A. 


16. Mass and momentum 

In section 2 it was shown that Newton's laws of motion conform to the special 
principle of relativity. However, the argument involved classical ideas concerning 
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space-time relationships between two inertial frames and these have since been 
replaced by relationships based upon the Lorentz transformation. The whole 
question must therefore be re-examined and this we shall do in this and the 
following section. 

We shall begin by considering the conservation of momentum, equation (1.3), 
for the impact of two particles by which mass is defined in classical mechanics. 
Since the velocity vectors u,, etc. are not vectors relative to orthogonal 
transformations in space -time, and indeed transform between inertial frames in a 
very complex manner, it is at once evident that equation (1.3) is not covariant with 
respect to transformations between inertial frames. It will accordingly be 
replaced, tentatively, by another equation, viz. 


M,U, +M 2 L 2 = M,V, +M 2 V 2 (16.1) 

where U,, etc. are the 4-velocities of the particles and M t , M 2 are invariants 
associated with the particles which will correspond to their classical masses. This 
is a vector equation and hence is covariant with respect to orthogonal 
transformations in space-time as we require. Equation (16.1) will be abbreviated 
to the statement 


I M V is conserved (16.2) 

and then, by equation (15.6). this implies that 

I m(v, ic) is conserved ( 1 6.3) 

M 

where m = = — 5 m (16.4) 

d-tV )' 2 

By consideration of the first three (or space) components of (16.3), it will be clear 
that 

I m\ is conserved (16.5) 

and. by consideration of the fourth (or time) component that 

Zm is conserved (16.6) 

If, therefore, m is identified as the quantity which will play the role of the 
Newtonian mass in special relativity mechanics, our tentative conservation law 
(16.1) is seen to incorporate both the principles of conservation of momentum 
and of mass from Newtonian mechanics. The principle (16.1) is accordingly 
eminently reasonable. However, our ultimate justification for accepting it is, of 
course, that its consequences are verified experimentally. We shall refer to such 
checks at appropriate points in the later development. 

It appears from equation (16.4) that the mass of a particle must now be 
regarded as being dependent upon its speed 1 : If c = 0, then m = M. Thus M is the 
mass of the particle when measured in an inertial frame in which it is stationary. 
M will be referred to as the rest mass or proper mass and will, in future, be denoted 



43 


by m„ To distinguish it from m 0 , m is often called the inertial mass. Then 


m = — — 


(1 -rV)'- J 


(16.7) 


Clearly m-*iast'-> c, implying that inertia effects become increasingly serious 
as the velocity of light is approached and prevent this velocity being attained by 
any material particle. This is in agreement with our earlier observations. Formula 
(16.7) has been verified by observation of collisions between atomic nuclei and 
cosmic ray particles (e g., see F.xercise 27 at the end of this chapter). 

We shall define the 4 -momentum vector P of a particle whose rest mass is m 0 and 
whose 4-velocity is V. by the equation 

P = m 0 V (16.8) 

Since m 0 is an invariant and V is a vector in space-time, Pisa vector. By equation 
(15.6), 

P-m u (l -v 2 /c 2 )' 1 1 (v,i<) - (my,imc)= (p,imt) (16.9) 


where p = mv is the classical momentum. 

Relative to the special orthogonal transformation (5.1), the transformation 
equations for the components of P are 

P, = P, cos y. + P 4 sin a P 2 = P 2 

P 4 = - P, sin a + P 4 cos a P } = P } 


(16.10) 


Substituting for the components of P from equation (16.9) and similarly for P, 
and employing equations (5.7), it will be found that 


Px 

Py 


mu_ 

(1 - mV) 1 * 


Pz = Pz 


m = 


m- p x u/c 2 

(1 -u 2 /c 2 )' 2 


(16.11) 


(16.12) 


Equations (16.1 1) constitute the special Lorentz transformation equations for the 
components of the momentum p and equation (16.12) the corresponding 
transformation equation for mass. Since p x = mv x , this equation can also be 
written 


m 


1 - uvjc 
-u 2 /c 2 )' 12 


(16.13) 


This reduces to the classical form of equation (2.4) if u, c x are negligible by 
comparison with c. 
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17. The force vector. Energy 


We have seen that in classical mechanics, when the mass of a particle has been 
determined, the force acting upon it at any instant is specified by Newton's second 
law. Force receives a similar defintion in special relativity mechanics. The mass of 
a particle with a given velocity can be determined by permitting it to collide with a 
standard particle and applying the principle of momentum conservation. 
Equation (16.7) then gives its mass at any velocity. The force f acting upon a 
particle having mass m and velocity v relative to some inertial frame is then 
defined by the equation 



dp 

d; 


(17.1) 


where p is the particle’s momentum. Clearly f will be dependent upon the inertial 
frame employed, a departure from classical mechanics. 

Definition (17.1) implies that, if equal and opposite forces act upon two 
colliding particles, momentum is conserved. However, although experiment 
confirms that momentum is indeed conserved, Newton's third law cannot be 
incorporated in the new mechanics, for it will appear later that, if the forces are 
equal and opposite for one inertial observer, in general they are not so for all such 
observers. Equation (16.1) therefore replaces this law in the new mechanics. 

f is not a vector with respect to Lorentz transformations in space-time. 
However, a 4-force F can be defined which has this property. The natural 
definition is clearly 


F 


dP 

dr 



(17.2) 


P being the 4-momentum and t the proper time for the particle. F is immediately 
expressible in terms of f for, by equation (16.9), 



(p, imc ) 


d d i 

= - -(p, imc) — 
dt dr 


= (1 
= (1 


■u 2 /c 2 f 

■v 2 /c 2 )- 


1 2 (p, imc ) 
' 2 (lime) 


(17.3) 


The vectors V, F are orthogonal. This is proved as follows: From equation 
(15.6), 

V 2 - — c 2 


Differentiating with respect to r. 


V 


dV 

dr 


= 0 


(17.4) 
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i.e. 


V-F = 0 


(17.5) 


as stated. This result has very important consequences. Substituting for V and F 
from equations (15.6) and (17.3) respectively, it is clear that 


This is equivalent to 


( 1 — v 2 jc 2 ) 1 (v, ic) • (f, imc) = 0. 
v • f — c 2 m = 0. 


(17.6) 

(17.7) 


But, by definition, v • f is the rate at which f is doing work. It follows that the work 
done by the force acting on the particle during a time interval (r t , r 2 ) is 

'2 

j c 2 mdf = m 2 c 2 - m, c 2 (17.8) 

'i 

The classical equation of work is 

work done = increase in kinetic energy (17.9) 

where T = {mv 2 is the kinetic energy (KE). Equation (17.8) indicates that in 
special-relativity mechanics we must define T by a formula of the type 

T = me 2 + constant (17.10) 

When v = 0,7" = 0 and this determines the unknown constant to be -m 0 c 2 .Thus 


7" = 


m o c 


(1 -c 2 /c 2 ) 1 ' 2 


■ WnC‘ 


(17.11) 


If v/c is small (1 — v 2 /c 2 )~ 1 ' 2 = 1 +v 2 /2 c 2 approximately and the above equ- 
ation reduces to T = jm 0 v 2 , in agreement with classical theory. 

According to equation (17.10), any increase in the kinetic energy of a particle 
will result in a proportional increase in its mass. Thus, if a body is heated so that 
the thermal agitation of its molecules is increased, the masses of these particles, 
and hence the total body mass, will increase in proportion to the heat energy 
which has been communicated. 

Again, suppose two equal elastic particles approach one another along the 
same straight line with equal speeds v. If their rest masses are both m 0 , the net 
mass in the system before collision is 

2m 0 /(l -v 2 /c 2 ) 1 ' 2 

It has been accepted as a fundamental principle that this mass will be conserved 
during the collision. However, from considerations of symmetry, it is obvious that 
at some instant during the impact both particles will be brought to rest and their 
masses at this instant will be proper masses m' 0 . By our principle, 

-, , 2m 0 

0 - (1 -v 2 /c 2 ) 112 


(17.12) 
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It follows, therefore, that, at this instant, the rest mass of each particle has 
increased by 


m 0 

(1 -v 2 /c 2 )' :2 


-m 0 = T/c 2 


(17 13) 


where T is the original KE of the particle and use has been made of equation 
(17.1 1). Now, in losing this KE, the particle has had an equal amount of work 
done upon it by the force of interaction and this has resulted in a distortion of the 
elastic material of which it is made. At the instant each particle is brought to rest, 
this distortion is at a maximum and the elastic potential energy as measured by 
the work done will be exactly T. If we assume that this increase in the internal 
energy of the particle leads to a proportional increase in mass, the increment of 
rest mass (17.13) is explained. If the particles are not perfectly elastic, the work 
done in bringing them to rest will not only increase the internal elastic energy, but 
will also generate heat. Both forms of energy will then contribute to increase the 
rest masses. 

Such considerations as these suggest very strongly that mass and energy are 
equivalent, being two different measures of the same physical quantity. Thus, the 
distinction between mass and energy which was maintained in classical physical 
theories, has now been abandoned. All forms of energy £, mechanical, thermal, 
electromagnetic, are now taken to possess inertia of mass m, according to 
Einsteins equation , viz. 

E = me 2 (17.14) 


Conversely, any particle whose mass is m , has associated energy E and. by 
equation (17.1 1 ), 

E = T+m 0 c 2 (17.15) 

m 0 c 2 is interpreted as the internal energy of the particle when stationary. If the 
particle were converted completely into electromagnetic radiation, m g c 2 would 
be the energy released. This is the source of the energy released in an atomic 
explosion. The mass of the material products of the explosion is slightly less than 
the net mass present before the explosion, the difference being accounted for by 
the mass of the energy released. Even a small mass deficiency implies that 
an immense quantity of energy has been released. Thus, if m = 1 kg. i = 3 
x 10 8 ms“‘, then £ = 9 x 10' 6 J = 2.5 x 10 ,o kWh. 

The principle of conservation of mass, which has been incorporated into the 
new mechanics, is now seen to be identical with the principle of conservation of 
energy, which is accordingly also regarded as valid in the new mechanics. 
However, the distinction between the two principles, which was a feature of the 
older mechanics, has disappeared. 


18. Lorentz transformation equations for force 
By equation (17.7), 

ime = - f • v 

c 


(18.1) 
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Referring to equation (1 7.3), F can now be completely expressed in terms of f; thus 

F = (1 -v 2 /c 2 )~' :2 (18.2) 

Relative to the special Lorentz transformation, the transformation equations 
for the components of F are 


F, = F, cos a + F 4 sin a F 2 = F 2 1 
F 4 = - F, sin oc + F 4 cos a F 3 = F 3 J (18.3) 

Substituting from equation (18.2) into the first three of these equations and 
employing equations (5.7), it follows that 


l = q(l -~ 2 f-v) 

Ty=Q( 1 -u 2 /e 2 )' l2 f y ' 

Tz = Q( 1 -U 2 /C 2 ) l ‘ 2 f, 


(18.4) 


where Q is given by equation (15.10). Substituting for Q from the fourth of 
equations (15.9), it will be found that 


J ( U (fyVy+LVz) 
Jx Jx c 2 ' 1 -uvjc 2 
T (1 -U 2 /C 2 ) 1,2 

1 -uvjc 2 h 

T _(1 -u 2 /c 2 )‘ 2 r 
h ~ 1 -uvjc 2jx 


(18.5) 


These are the special Lorentz transformation equations for f. If u, v are negligible 
by comparison with c, these equations reduce to the classical form of equation 
( 2 . 6 ). 

It is clear from equations (18.5) that, if equal and opposite forces are observed 
from S to act upon two particles, the forces observed from S will not be so related 
unless the particles’ velocities are the same. 


19. Fundamental particles. Photon and neutrino 

By eliminating m and v between equations (16.7), (17.14) and the equation p = mv 
giving the linear momentum of a particle, it will be found that 

£ = c v /(P 2 +mlc 2 ) (19.1) 

This useful equation relates the total energy £ of a particle (including its internal 
energy) with its momentum p. A special case of great importance is when m 0 = 0, 
and then 


E = cp 


(19.2) 
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For such a particle, m = E/c 1 = p/c; i.e. its rest mass vanishes, but its inertial mass 
is non-zero. This result is inconsistent with equation (16.7), unless v = c (in which 
case the right-hand member becomes indeterminate). We conclude that any 
particle having zero rest mass must always move with the speed of light. 

Two such particles are known, the photon and the neutrino. The former is a 
quantum of electromagnetic energy and the latter is a particle which is generated 
in some interactions between fundamental particles governed by the weak 
interaction force (e.g. the decay of a neutron into a proton). Neither particle 
exhibits any electric charge. If either particle is absorbed by other matter, it loses 
its identity and delivers its energy and momentum to the absorbing body -the 
heating of a metal plate placed in sunlight is an example of the absorption of 
photons. Neutrinos are exceptionally difficult to absorb and hence to 
detect -there is a high probability that a neutrino from the sun will pass right 
through the earth without interaction with a single one of its atoms. 

As an example of a particle interaction in which a neutrino is involved, consider 
the decay of a negative pion (a meson ) into a muon (heavy electron) and a neutrino. 
Assuming that the pion is at rest in the laboratory frame, the momenta of the muon 
and neutrino will have equal magnitudes p, but will be in opposite senses. If m„, 
are the proper masses of the pion and muon respectively, the principle of 
conservation of energy leads to the equation 

m«c 2 = cp +c J{p 2 +mlc 2 ) (19.3) 

having used equations (19.1) and (19.2). Solving for p. we find 

P = c(ml-ml)l7m n (19.4) 

The energy of the muon is now found to be 

£„ = m,c 2 -cp = c 2 (ml + ml)/2m, (19.5) 

and the KE of this particle is accordingly 

T„=E„- m„c 2 = c 2 (m„ - m„) 2 /2m„ (19.6) 

From tables, we find that the rest masses in atomic energy units are m„c 2 = 140 
MeV, m„c 2 = 106 MeV (mega-electron volts). Hence, T„ = 4.1 MeV. This value 
has been checked experimentally by observing the length of the path of the muon 
in the resistive medium in which it is generated (usually liquid hydrogen). 


20. Lagrange’s and Hamilton’s equations 

Suppose that a particle having constant rest mass m 0 is in motion relative to an 
inertial frame under the action of a force derivable from a potential V. Then its 
equations of motion are 


d 

d7 


\ m 0 x ) 
{(1- v 2 /c 2 ) 112 j 


dV 
dx ’ 


etc. 


( 20 . 1 ) 
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Expressed in Lagrange form, these equations must be 


d(cL\ = <'L 
dt V cx ) ex ' 


and hence L must be a function of x, y, 2 , x, y, z, such that 


cL m n x cL cV 

— = r- rr ; — = , etc. 

cx (1 — k /c ) • cx cx 

Since v 2 = x 2 + y 2 + z J , these equations can be validated by taking 

L- -m 0 c 2 (l -v 2 /c 2 ) 112 - V 

which is accordingly the Lagrangian for the particle. 

Now 

cL 

— = P„ etc. 


and it follows exactly as in classical theory that, if the Hamiltonian H is defined by 
the equation 

H = Pxi'x + Py v y + Pz v z - L (20.6) 

and is then expressed as a function of the quantities x, y, z, p x , p y , p z alone, the 
Lagrange equations (20.2) are equivalent to Hamilton’s equations 


X ~ CPx' 


Px = -- 


p x V x + p y V f + P! V z - (] 

and hence 


+ V 

(1 - tV ) 1 ' 2 


the total energy, precisely as for classical theory. 


Px + Py+ Pz = 


1 — v 2 /c 2 


_ 2 _2 

2 2 1 

= " moC + r=7^ 

= - m^c 2 + E 2 /c 2 


(20.10) 
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and it follows that 

E 2 = c 2 (p 2 +pf. + p 2 + mfic 2 ) (20.11) 

Substituting in equation (20.9), 

H = c{p 2 + pi + p] + mlc 1 )' 2 + V (20.12) 

expressing Has a function of x, y, 2 , p x , p y , p.. The reader is now left to verify that 
Hamilton’s equations are equivalent to the equations of motion (20.1). 


21. Energy -momentum tensor 

Suppose that there is a continuous distribution of mass over some region of space. 
In this section, we shall suppose this to take any physical form whatsoever. For 
example, the distribution may be in the form of the molecules of an elastic body 
and, in this case, the mass must include a component corresponding to the mass of 
the potential energy of the field of stress, in addition to the inertia of the particles 
themselves. Such a field will be electromagnetic in nature, the electric charges 
present in the molecules being ultimately responsible for its presence; we shall 
not, therefore, exclude a further contribution to the mass-energy distribution 
from any other electromagnetic field which may happen to be present. Any 
random motion of particles exhibiting itself as heat energy will also make a 
contribution. Equations governing this combined mass flow will now be derived. 

Let S be an inertial frame Ox, x 2 x 3 and let p', p", etc., be the densities of 
inertial mass at a point of the frame due to the various contributors. Then, if v', v", 
etc., arc the respective velocities of flow of these components, the net density of 
linear momentum g will be given by 

g =p'v' -f p"v" + . . . = pv, (21.1) 

where 

p = p' + p" + ... (21.2) 

is the net density of inertial mass. Equation (21.1) defines the mean velocity of 
mass flow v. In time dz, the mass flowing across an area d A having unit normal n is 

p'v' • nd/1 dz + p"v" • ndA dz + . . . = pv • ndA dz (21.3) 

thus, the rate of mass flow across unit area is pv ■ n = g- n, implying that g is also 
the current density vector for the mass flow. The components of g will be written 
g, (Greek indices will range over values 1, 2, 3). 

Let g 1 ' 1 be the current density vector for the flow of the x, -component of 
momentum, i.e, the rate of flow of this component of momentum across unit area 
with unit normal n is g 1 ” ■ n. The x^-component of g 1 ” will be denoted by g, fi . In the 
special case of a cloud of non-interacting particles (no stress field), whose velocity 
of flow is v, since the density of the x x -component of momentum is g,, the quantity 
of this component passing over the unit area in unit time is g, v • n. It follows that 
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g 1 ’ 1 = <y,v and hence, 

(he = (21.4) 

A simple distribution of this type will be referred to as an incoherent cloud. 

For any distribution which includes material particles, in addition to the 
internal forces of interaction between the particles, there may be other forces 
acting upon them due to agents which are regarded as external to the system; such 
forces will be termed external forces. Let do be the volume occupied by a small 
element of the fluid or solid which is formed from these particles; if v is the flow 

velocity of the element and do 0 is its proper volume (i.e. volume measured in a 

frame in which the element is momentarily stationary), then 

do = ,J(l -i 2 /c 2 )da> 0 (21.5) 

since all lengths parallel to the flow will be subject to a Fitzgerald contraction. If 
df is the resultant external 3-force acting on the element, we shall define the 3- 
force density d at the element to be such that df = d do. Similarly, if dF is the 
external 4-force on the element, the 4-force density will be D, where dF = D da> 0 ; 
since daj 0 is a 4-invariant, this defines Dasa4-vector. Reference to equation (18.2) 
shows that we can write 

dF = (1 — r 2 /c 2 ) _ 1 2 (df, idf-v/c) (21.6) 

and this is equivalent to the equation 

Ddw 0 = (1 - v 2 /c 2 )~ 1 2 (d, id - v/c)da> (21.7) 


It now follows from equation (21.5) that 

D = (d, id-v/c) (21.8) 

which relates 3- and 4-force densities. 

Now suppose Z is a closed surface which is stationary relative to the frame S 
and let da be the area of a surface element whose outwardly directed unit normal 
is n. Then the rate of increase of the total mass inside Z must equal the rate of 
inflow of mass across Z, plus the rate at which external forces acting on any 
particles inside Z generate energy (and hence mass) by performing work. Let T 
denote the interior of Z and dot a volume element of T. Then conservation of 
inertial mass is expressed by the equation. 

^ £ fcdut = — j* g-nda + -jj d-v dw (21.9) 

Converting the surface integral into a volume integral over T by the divergence 
theorem, this equation is seen to be equivalent to 
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Since f is arbitrary, this implies that 


eg 1 

~ + 9a. a — 2 

at c 


( 21 . 11 ) 


where the summation convention is being applied to the Greek indices. 

Since the x.-component of the linear momentum of the particles within the 
volume element dot will be increased at a rate d, dcu by the external forces and g"” 
is the current density vector for the flow of this component of momentum, the 
equation corresponding to (21.9) expressing the conservation of linear momen- 
tum is 


d^ 

dr 


I 


9 , dot 


j* g'^-nda-f 


d, dm 
r 


( 21 . 12 ) 


Again, by application of the divergence theorem, we can show that this implies 
that 


eg, , . 

~g~ + 9ati.ii ~ d. 


(21.13) 


These equations (21.11), (21.13), of conservation of inertial mass (or energy) 
and linear momentum can be expressed in four-dimensional form by the 
introduction of Minkowski coordinates x, and by the definition of a 4-tensor T tJ 
according to the equations 

= 9,s, T ,4 = T 4 , = icg„ T 44 = —c 2 /j (21.14) 


It may be verified that, with this notation, the equations reduce to the form 

T„„ = D, (21.15) 

where the 4-force density D, is given by equation (21.8). T tj is called the 
energy-momentum tensor for the mass-energy distribution. By assuming T tj 
behaves like a 4-tensor on transformation between inertial frames, equation 
(21.15) is guaranteed to be valid in all such frames and the special principle of 
relativity is satisfied. 

In the special case of an incoherent cloud of particles flowing with velocity v, we 
have g , = gv„ and g = g,v$ = gv,v^. Equations (21.14) now yield 

T t] = g 00 K V, (21.16) 

where V = (1 - v 2 /c 2 )“ 1,2 (v, ic) is the 4- velocity of flow and 

9oo = d-i-V)^ (21.17) 

Since the density of rest mass of the cloud observed from S is g j (1 —v 2 /c 2 \ the 
rest mass of the particles in the volume element da> is g (\ -v 2 /c 2 ) dco. 
Observed from a frame S 0 in which the particles in this element are momentarily 
stationary, the volume of the element will be dio 0 ; hence, the density of proper 
mass in S 0 is ^ ^/(l - u 2 /c 2 )dio/dco 0 = ^(1 -v 2 jc 2 ) = ji 00 , having used equation 
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(21.5). p 00 is accordingly referred to as the proper density of proper mass of the 
cloud; it is a 4-invariant and equation (21.16) clearly expresses T tj as a 4-tensor. 

In many circumstances, the 4-force density D, of the external force field can be 
expressed as the divergence of a second rank tensor, i.e. we can write 

D, = — Sijj (21.18) 

Equation (21.15) then reduces to 

(7-, + S,,)., = 0 (21.19) 

This shows that the external force field can be treated as an additional component 
of the original mass-energy distribution, contributing its own energy- 
momentum tensor S- , The field is then regarded as possessing energy of density 
c 2 p = -S 44 , x,-component of momentum of density g a = S,Jic, and the 
momentum flow within the field is described by^ = V There being no external 
forces operating on the enlarged system, it is said to be isolated, and if 7y is taken 
to denote the combined energy-momentum tensor, the equations of conservation 
of energy and momentum for the overall distribution take the form 

7^ = 0 (21.20) 

i.e. Tij has a vanishing divergence. 

22. Energy-momentum tensor for a fluid 

In this section we will calculate the equations of motion for an elastic fluid moving 
under the action of an external force field of density d,. From these, the 
energy-momentum tensor for the fluid, including its internal stress field, can be 
derived. 

Let be the stress tensor, i.e. (r l4 , t 2j , r 3a ) are the components of the force 
exerted across unit area, whose normal is parallel to the x„-axis, by the particles on 
the side for which x„ takes lesser values upon the particles on the side for which x, 
takes greater values. Consider a fluid element in the shape of a small tetrahedron, 
three of whose faces are normal to the axes and whose fourth face has unit normal 
n, (Fig. 4). If <5x|, <5x 2 , <5x 3 are the respective lengths of the edges parallel to the 
axes, the stress force acting on the face parallel to the coordinate plane Ox 2 x 3 will 
have components i(r, ,, r 21 , t 31 )<5x 2 <5x 3 ; the forces acting on the two faces 
parallel to the other two coordinate planes can be calculated similarly. Let *5/1 be 
the force applied to the sloping face, <5 A being its area. Then the x a -component of 
the equation of motion of the element is 

ir.,,<5x 2 <5x 3 + jTi,2<5x3<5xi + jt, 3 <5x,<5x 2 + s 3 <5/1 + £<5xi<5x2<5x 3 c/, = 

( 22 . 1 ) 

where p, is the momentum of the element. Since the face of area £<5 x 2 <5x 3 is the 
projection of the area 8 A on to the coordinate plane Ox 2 x 3 , n t 8A = ^<5x 2 <5x 3 ; 



54 



Fig. 4 

similarly, rt 2 SA = j<5xj<5xi, n 2 6A = ^<5xi<5x 2 . Thus, 

s, = -(!,,«,+ r l2 n 2 + T a3 n 3 ) + (terms of third order in dx,)/SA. (22.2) 
In the limit as 8x, -* 0, this gives 

s x = — T i0 n i (22.3) 

Now consider the motion of a small element of fluid, of any shape, bounded by 
a surface I (I moves with the fluid). If do is an element of I (Fig. 5) whose unit 
normal is n„ the force exerted on it by the neighbouring fluid is - r, e n e do and the 
resultant stress force on the complete element is therefore 

- | ^rijd o = - I dw (22.4) 

J T J r cx e 



Fig. 5 
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where T is the interior of I and we have used the divergence theorem. Thus, the 
force density for the stress field is — # . 

Let g„ be the total momentum density for the fluid, including the elastic 
potential energy generated by the stress field. If Sw now represents the volume of 
the fluid element inside Z, the momentum of the element is g^Sto and the rate of 
change of momentum is 

d do, d 

— (gM>) = (22.5) 

where the derivatives are calculated following the fluid motion. Thus, 

d, ( 22 - 6 ) 

where v, is the velocity of flow. During a short time Si, the surface element do 
traces out a volume v-n da St and the increase in the volume of So is accordingly 


: v n da = St di 
Jj; j r 


div v dco = St Soj v„ 


— (^at) = daWf'i, 

Equations (22.5), (22.6) now give for the rate of momentum change 


[i t + * + »r = lir ■ + ix, M r 

We can now write down the equation of motion of the element, viz. 


ej d«= (4- fao. o)d« 


( 22 . 10 ) 


+ (9a l > + 0 — d* 


(22.11) 


At this stage it should be noted that we are disregarding any flow of heat which 
may take place by conduction within the fluid. Such a flow of energy would 
contribute its own momentum and further terms would need to be included in 
equation (22.11) to allow for this. 

We next calculate the equation of energy for the fluid element. The rate at 
which the stress force acting upon da does work is — T^n^da and the total rate 
of doing work by these forces on the element is therefore 

- V'T^tiedo ~l v,T af )d<D (22.12) 

J* h ox > 
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by the divergence theorem. The rate of doing work by the external forces on the 
element is d^v^dw. If g is the density of the total inertial mass of the fluid 
(including the elastic potential energy and any heat generated by compression), 
the energy of the element is c 2 gdu). Thus, the equation of energy is 


— (c 2 ^dw) 




Using equation (22.8) and 


d/i 

dr 


Vfi 

ft 


(22.13) 


(22.14) 


equation (22.13) gives 

+ {/**■> /»}.? = (22.15) 

We now compare the equations (22.1 1), (22.15) with the equations (21.13), 
(21.11), for a general mass energy flow. It is seen that, for the fluid, we must take 


Q, = + 1 2 v » x ts* (22.16) 

d*0 = 9,V0 + *,0 (22.17) 


Substituting for g, from equation (22.16) into (22.17), we get the alternative 
formula 


9<,f = 9^0 + z,0 + pT yl v v v 6 (22.18) 

Equations (21.14) now yield the components of the energy-momentum tensor, 
viz. 


7~,0 — gt*V0 + Zj0 + t ya^yt : 0 


= Ti, = ic(fiv* + L-0i0j 
r 44 = -c 2 g 


(22.19) 


Alternative forms for these components involving the 4-velocity of flow V-, can 
also be found (see Exercise 68 at the end of this chapter). 

Two special cases of these results are of great importance. The first is that of an 
incoherent cloud for which we have = 0. Then T Jt> = gc a v f , 7", 4 = icgv x , 
r 44 = - c 2 g , and using equation (21.17), we derive equation (21.16) again. It 
should, however, be noted that equation (21.17) is not valid in the general case of 
an elastic fluid, since the inertial mass density g includes a component due to the 
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elastic energy and the value of this component in the rest frame cannot be found 
by simple multiplication by (I -v 2 /c 2 ). 

The second special case is that of a perfect fluid. This is defined to be a fluid in 
which there are no shearing stresses in a frame S° relative to which it is at rest. 
Thus, in S°, z% = pd where p is the pressure. Equations (22.19) now give for the 
components of the energy momentum tensor in S° the values 


( 7 -°) = 


p 0 0 0 
0 p 0 0 
0 0 p 0 

0 0 0 ~ c2 Poo 


( 22 . 20 ) 


If V { is the 4-velocity of flow of the fluid, it is now easy to verify that the tensor 
equation 


T, j = (Poo + p/c 2 )V i V i + pd ll (22.21) 


is valid in the frame S°(p ao , p being invariants) and, hence, is valid in all frames. 
Thus, if p is the density of inertial mass in a frame S, 


P = -T^/e 2 


Poo + P / L ' 2 _ P 
1 — v 2 /c 2 c 2 


( 22 . 22 ) 


or P + P/c 2 = (/i 00 + p/c 2 )/(l -v 2 /c 2 ) 


(22.23) 


It now follows that 


j V a 

9, = T^/ic = (poo + P/c h m 

1 - V fC 

= {p + p/c 2 ) V x (22.24) 

Comparing this last equation with equation (22.16), we deduce that pv t = 
identically, which implies that 

t* = (22-25) 

i.c. there is no shearing stress in any frame and the pressure is the same in all 
frames. 


23. Angular momentum 

A particle having momentum p 2 = mv„ at a point x 2 relative to a frame S is defined 
to have angular momentum 

= (x, -<*<.)?<! - 1*0 -<Jp)P„ (23.1) 

about the fixed point a„. Clearly, this defines the angular momentum as an anti- 
symmetric 3-tensor. In elementary mechanics, a more usual definition is by the 
vector product (x-a)xp, i.e. the pseudovector — a e )p y - however, the 

components of this pseudovector are found to be (h 23 , fi 3l ,h l2 ) and these are 
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three of the non-vanishing components of h^fthe other three are h 22 
etc.), so that the definitions are essentially equivalent. 

If j, is the 3-force acting on the particle, then 


dfc* 

df 


dx, dx„ d p„ dp, 

d7 P ' ,- "dr Pl ,Xs_a ’ "d? 


-h 


23» 


= (x„-o a )/,-(x, -a 0 )f, 


(23.2) 


since p, = mv, = mx 2 and p, = f 2 . The right-hand member of the last equation is 
another anti-symmetric 3-tensor called the moment of the force /, about the point 
a 2 . Denoting this moment by m 20 , we have derived the equation of angular 
momentum, viz. 


dt 




(23.3) 


If m 20 = 0, then h 20 is constant and the angular momentum is conserved. 

In the case of the continuous distribution of mass-energy considered in 
section 21, the momentum of an element of volume da> is g 2 dto and the angular 
momentum of the whole system about a, is defined by the equation 


p 

(*«<> = {(Xa-a.tatf-lxp-n^&ldcu 

Jr 


1 

ic 



(x 0 -a 0 )T, i }dw 


(23.4) 


where F is the region occupied by the system and T u is the energy-momentum 
tensor. If the system is imagined to be situated in otherwise empty space, so that 
there is no container exerting forces on its bounding surface, the region T can be 
extended to include the rest of space. In these circumstances, differentiating 
equation (23.4) with respect to t, we find 


dh 

~dt 


— = * ! {x 2 -a 2 )T 0tA -(x 0 -a s )T 0A , A }du) 


-l 

■I 


{ (x, - a,) (D„ -T 0y y ) - (x s - a 0 ) (£>, -F, y y )}dco, 


(x, -ajd 0 -(x 0 -a 0 )d x }dw 

{ [(x, - a«)7> T ]. y - [(x s - a 0 )T ay ],., - x,,„ T 0y + x„. y T, y )da> (23.5) 


having used equations (21.8) and (21.15). Applying the divergence theorem to the 
first two terms involving the energy-momentum tensor, since T h vanishes at a 
great distance from the distribution, these terms make zero contribution. Thus, 
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the equation reduces to the form 

= j" {{x,-a 1 )d^-(x^-a f )d a }da)+ j (T e ,-T^)6w (23.6) 

(Note that x a y = <5, r ) The first integral in the right-hand member of equation 
(23.6) is the moment m ltl of the external field forces about u a and it follows that the 
equation of angular momentum (23.3) is valid for the distribution if u is 
symmetric. Since T, 4 = 7-4,, this condition is equivalent to the requirement that 
the energy-momentum tensor should be symmetric. 

The assumption that T tj is symmetric, and hence that the angular momentum of 
a continuous distribution not acted upon by external forces is conserved, is 
always made and is, indeed, necessary for the development of the general theory 
of relativity (see section 47). Reference to equations (22.19) indicates that this 
assumption implies that the stress tensor for an elastic fluid cannot be 
symmetric as was always assumed in the classical theory; it will, however, be very 
nearly symmetric in the case when its components and the flow velocity are 
sufficiently small. 


Exercises 3 


1. Obtain the transformation equations for v by differentiating the Lorentz 
transformation. 

2. Obtain the transformation equations for the acceleration a by differentiat- 
ing the transformation equations for v and express them in the form 

.. (1 - u 2 /c 2 ) il2 

a x = — -T7T<J 


(l-c x w/c 2 ) 3 

1 -u 2 /c 2 
(1 v x u/c 2 ) 2 

1 - u 2 /c 2 


( a ' 


Vy»/C 
1 v x ujc 2 


(1 - V x u/c 2 ) 2 


v z u/c 

° z + T^v~H/c 2 ’ 


Deduce that a point which has uniform acceleration in one inertial frame has not, 
in general, uniform acceleration in another. 

3. A nucleus is moving along a straight line when it emits an electron. As seen 
from the nucleus, the electron’s velocity is 6c/7 making an angle of 60 c with its 
direction of motion. A stationary observer measures the angle between the lines of 
motion of nucleus and electron to be 30". Calculate the speed of the nucleus. 
(Ans. 3c/5.) 

4. A nucleus is moving with velocity 3c/5 when it emits a /(-particle with 
velocity 3c/4 relative to itself in a direction perpendicular to its line of motion. 
Calculate the velocity and direction of motion of the /3-particle as seen by a 
stationary observer. If the /(-particle is emitted with velocity 3c/4 in such a 



60 


direction that the stationary observer sees its line of motion to be perpendicular to 
that of the nucleus, calculate the direction of emission as seen from that nucleus 
and the velocity of the /f-particle as seen by the stationary observer. (Ans. 3c/5 at 
45 to line of motion; n -a to line of motion where cos a = 4/5, 9c/ 16.) 

5. Show that the 4-velocity V is of constant magnitude ic. 

6. A beam of light is being propagated in the xy-plane of Sat an angle a to the 
x-axis. Relative to S it is observed to make an angle a with Ox. Prove the 
aberration of light formula, viz. 


cot a = 


cot a - (u/c)coseca 
(1 -u 2 /c 2 ) 1 ' 2 


Deduce that, if u c, then 


Ax = a - a = - sin a 
c 

approximately. 

7. A particle of rest mass m 0 is moving under the action of a force f with 
velocity v. Show that 

m 0 dv m 0 vv/c 2 
(1 — v 2 /c 2 ) 1 ’ 2 df + (1 - r 2 /c 2 ) 3;2 V 

Hence, if the acceleration dv/dt is parallel to v, show that 


" (1 — t 2 /c 2 ) 3 ’ 2 dt ’ 
and if the acceleration is perpendicular to v, then 

f m o dv 
(1 -v 2 /c 2 ) 1 ’ 2 dt 

8. Two particles are moving along the x-axis of a frame S with velocities t,, v 2 . 
Calculate the velocity u with which a parallel frame S must move parallel to the x- 
axis of S, if the particles have equal and opposite velocities relative to S Show that 
the magnitude of these velocities is 

c 2 -(c 2 -t’f) ,,2 (c 2 -tl) 1 ' 2 


assuming v, > v 2 > 0. 

9. A bullet of length d is moving with velocity v. The line of sight from a camera 
makes an angle a with the bullet’s velocity. Behind the bullet and parallel to its 
axis is a stationary measuring scale. If the camera takes a photograph of the bullet 
against the background provided by the scale, show that the bullet’s length as it 
appears on the scale is 

(1 -v 2 /c 2 )' l2 d 
1 + u cos a/c 
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10. A cart rolls on a table with velocity kc. A smaller cart rolls on the first in 
the same direction with velocity kc relative to the first cart. A third cart rolls on 
the second with relative velocity kc, and so on up to n carts. If cv r is the velocity of 
the rth cart relative to the table, prove that 

v r + k 


Deduce that 

(1 +k) n - (1 -k) n 
L " ~ (1 +k)" + Jl -k) n 


What is the limit of v„ as n -* oo? (Ans. c.) 

1 1 . A nucleus disintegrates into two parts, A and B which move with equal and 
opposite velocities of magnitude V. A then ejects an electron whose velocity 
observed from A has magnitude Kand direction perpendicular to the direction of 
/I’s motion. Show that, as observed from B, the electron’s velocity makes an angle 
a with the direction of A's motion, where tan a = j(l - V 2 /c 2 ) and calculate the 
magnitude of the velocity of the electron relative to B. (Ans. V{4 + 
(1 - V 2 /c 2 ) 2 }' l2 /(\ - V 2 /c 2 ).) 

1 2. A rocket moves along the x-axis in S, commencing its motion with velocity 
v 0 and ending it with velocity v, . If w is the jet velocity as measured by the crew 
(assumed constant), show that the mass ratio of the manoeuvre (i.e. initial 
mass/final mass) as measured by the crew is 

(c + c,)(c-p 0 ) c ' 2 ’ v 
(c-p,)(c + t> 0 )_ 

What does this reduce to as c -* oo? Deduce that, if the rocket starts from rest in S 
and its jet is a stream of photons, the mass ratio to velocity v is 



Show that, with a mass ratio of 6, the rocket can attain 35/37 of the velocity of 
light in S. 

1 3. S, S, S are inertial frames with their axes parallel. S has a velocity u relative 
to S and S has a velocity v relative to S, both velocities being parallel to the x-axes. 
If transformation from S to S involves a rotation through an angle a of the axes in 
space-time and transformation from S to S a rotation /?, a transformation from S 
to S involves a rotation y where y = a + p. Deduce from this equation the 
relativistic law for the composition of velocities, viz. 

u + v 
1 + uv/c 2 


w = 
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14. A force f acts upon a particle of mass m whose velocity is v. Show that 


15. An electrified particle having charge e and rest mass m 0 moves in a uniform 
electric field of intensity E parallel to the x-axis. If it is initially at rest at the origin, 
show that it moves along the x-axis so that at time f 



where k — eE/m 0 . Show that this motion approaches that predicted by classical 
mechanics as c -* ac . (It may be assumed that the force acting upon the particle is 
eE in the direction of the field at all times.) 

16. A tachyon transmitter always emits a tachyon at a speed v > c relative to 
itself. Observers A, B are equipped with such transmitters and B is moving away 
from A with velocity u < c. A transmits a tachyon towards B, who is at a distance 
das measured from A when he receives it, B immediately transmits a tachyon back 
towards A. Show that A receives this tachyon a time 

~ — .(2 u-v-uh/c 2 ) 
c(r - u) 

after transmitting his own Deduce that A receives the reply before (!) his act of 
transmission if 

v c+J(c 2 -u 2 ) 
c u 

17. v, V are the 3- and 4-velocities of a point. If a = dv/dr is the 3-acceleration 
and A = dV/dr is the 4-acceleration, prove that 

A 2 = (1 -i' 2 /c 2 ) -3 {(c 2 -i- 2 )a 2 +2r'fa-v — y 2 i' 2 }/c 2 

18. A mirror moves perpendicular to its plane with velocity t and away from a 
source of light. A ray from the source is reflected by the mirror. If 0 is the ray’s 
angle of incidence, show that the angle of reflection is <j>, where 

(v 2 +c 2 )cosO -2cv 

COS <j) 2 ~ 2 S h" 

+c — 2t;ccos0 

19. Two trains, each having the same rest length L, are moving in opposite 
directions with equal speeds U on parallel tracks. State the timeTthey take to pass 
one another according to a classical, non-relativistic, calculation. Show that the 
time taken, as measured by a driver of one of the trains and using a relativistic 
calculation, is also T. 

20. v, v are the velocities of a point relative to the inertial frames S, S 
respectively. Representing these vectors as position vectors in an independent 
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show that 


rv = q 


V + u < ■— * (P - \ )+ P 
u 


where p = (1 -u 2 /c 2 ) 12 and Q = 1/(1 + u-v/c 2 ). 
Show further that 


u 2 p v = Q[(l - p)u x (v x u) + pu 2 (u + v)] 
and hence verify that 

v 2 - Q 2 [(u + v) 2 - (v x u) 2 /c 2 ] 


21. A luminous disc of radius a has its centre fixed at the point (x, 0, 0)of the S- 
frame and its plane is perpendicular to the x-axis. It is observed from the origin in 
the S-frame at the instant the origins of the two frames coincide and is measured 
to subtend an angle 2a. Prove that, if x, then 


tana 



( Hint employ the aberration of light formula, exercise 6 above.) 

22. A particle moves along the x-axis of the frame S with velocity r and 
acceleration a. Show that the particle’s acceleration in S is 


__ (\-u 2 /c 2 ) 3 / 2 

a (1 -uv/c 2 ) 2 a 

If the particle always has constant acceleration a relative to an inertial frame in 
which it is instantaneously at rest, prove that 

where ft = (I — v 2 /c 2 )~ 1,2 and t is time in S. 

Assuming that the particle is at rest at the origin of S at l =0, show that its x- 
coordinate at time t is given by 

ax = c 2 [ ( 1 + a 2 i 2 /c 2 )' 2 - 1] 

23. Three rectangular Cartesian inertial frames S, S, S are initially coincident^ 
As seen from S, S moves with velocity u parallel to Ox and, as seen from S, J 
moves with velocity r parallel to Oy. If the direction of S’s motion as seen_from S 
makes an angle f^with Ox and the direction of S's motion as seen from S makes 
an angle <p with Oi, prove that 
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Deduce that, if u, t; < c, then 

<(> -9 = uv/2c 2 

approximately. 

24. The inertial frames S, Shave their axes parallel and the origin of S moves 
along the x-axis of S with velocity u. A rigid rod lies along the x-axis of S and is 
attached to it. IfTis the rod’s length as measured in S and / is its length measured in 
S, show that l =7(1 -u 2 /c 2 ) 112 . S' is a third parallel inertial frame whose origin 
also moves along the x-axis of S. Observed from S', the origins of S and S have 
equal and opposite velocities. Show that the velocity of S' observed from S is 



- (1 -u 2 /c 2 ) 1 ' 2 




A rod, identical to the one already referred to, lies along the x'-axis of S' and 
moves with this frame. Its length observed from S is L. Show that 



25. Two particles, each having rest mass m 0 , are moving in perpendicular 
directions with the same speed |c. They collide and cohere to form a single 
particle. Show that its rest mass is yj(l4/3)m 0 . (Assume there is no radiation of 
energy.) 

26. A particle of rest mass m, and speed v collides with a particle of rest mass 
m 2 which is stationary. After collision the two particles coalesce. Assuming that 
there is no radiation of energy, show that the rest mass of the combined particle is 
M, where 


M 2 


m] 4 - m\ + 


2m l m 2 
(1 -v 2 /c 2 )' 12 


and find its speed. 

27. A particle is moving with velocity u when it collides with a stationary 
particle having the same rest mass. After the collision the particles are moving at 
angles 9 , <t> with the direction of motion of the first particle before collision. Show 
that 

„ 2 
tantuanrf) = 

y+ 1 

where y = (1 — u 2 /c 2 y l/2 . (If c oc, y -» 1 and 9 + 4> = \n. This is the prediction 
of classical mechanics. However, if the particles are electrons and u is near to c in 
value, 9 + 4> < jrr. This effect has been observed in a Wilson cloud chamber.) 
(Hint: Refer the collision to an inertial frame in which both particles have equal 
and opposite velocities prior to collision.) 
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28. A body of mass M disintegrates while at rest into two parts of rest masses 
M, and M 2 . Show that the energies £,, £ 2 of the parts are given by 

^ „ yM 2 -M] + M 2 2 

£, = c , £, = c — 

2 M 2 2M 


29. Two particles having rest masses m,, m 2 are moving with velocities u,, u 2 
respectively, when they collide and cohere. If a is the angle between their lines of 
motion before collision, show that the rest mass of the combined particle is m, 
where 


m 


2 


m\ +m\ + 


2 m,m 2 (c 2 — Mucosa) 
V{(c 2 - uf) (t 2 — u|)} 


Show that, for all values of a, m 5m, + m 2 and explain the increase in rest mass. 

30. A photon having energy £ collides with a stationary electron whose rest 
mass is m 0 . As a result of the collision the direction of the photon's motion is 
deflected through an angle 0 and its energy is reduced to £’. Prove that 


m 0 c 2 



= 1 -COS0 


Deduce that the wavelength A of the photon is increased by 

AA = -^-sin 2 jd, 
m 0 c 


where h is Planck’s constant. (This is the Compton effect. For a photon, take 
A = he /E.) 

31. A particle P having rest mass 2 m 0 collides with a stationary particle Q 
having rest mass m 0 . After the collision, the rest mass of Q is unchanged, but the 
rest mass of P has been reduced to m 0 . If the lines of motion of the two particles 
after the collision both make an angle of 30° with the original line of motion P, 
calculate the original velocity of P and the momentum acquired by Q. (Ans. 
3^50/7; yj 1 5m 0 c.) 

32. A particle is moving with velocity v when it disintegrates into two photons 
having energies £,, £ 2 , moving in directions making angles a, with the original 
direction of motion and on opposite sides of this direction. Show that 


tan^atanz/? 


c - v 
c + v 


Deduce that, if a photon disintegrates into two photons, they must both move in 
the same direction as the original photon. 

33. A stationary particle having rest mass 3 m 0 disintegrates into a pair of 
particles, each of rest mass m 0 , and a neutrino. The directions of motion of the 
particle pair are at an angle 26, where cos# = 1/3. Calculate the energy of the 
neutrino and show that the speed of each of the other two particles is 3c/5. (Ans. 
?m 0 c 2 .) 
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34. A cosmic ray particle has rest mass m 0 and is moving with velocity 3c/5 
relative to a stationary observer. It is seen by this observer to emit a gamma ray 
photon with energy m 0 c 2 / 4 in a direction making an angle of 60 with its original 
line of motion. Show that the rest mass of the particle is reduced by a quarter and 
calculate the angle through which its velocity is deflected and its new speed. (Ans. 
tan" 1 ( V3/5); Jlc/4.) 

35. A neutron having rest mass is stationary when it disintegrates into a 
proton (rest mass m P ), an electron (rest mass m E ) and a neutrino. The proton 
moves in the opposite direction to the other two particles, which move along the 
same straight line. If 7" is the kinetic energy of the proton, prove that the kinetic 
energy of the electron is c(m E c -k) 2 /2k, where 

T 

k = (m N -m P )c -- - yJ(2m P T +T 2 /c 2 ) 

36. A particle having rest mass m 0 is at rest when it emits two photons, each of 
energy jm 0 c 2 . The particle recoils with rest mass |m 0 along a line bisecting the 
angle between the tracks of the photons. Calculate the angle between these tracks 
and the particle's velocity of recoil. If the photons are observed by an observer 
moving with the particle, show that the angle between their tracks is seen to be 2a, 
where sina = 1/7. (Ans. 30°; sJic/2.) 

37. A nucleus has rest mass M. Whilst at rest, it emits a photon. If the internal 
energy of the nucleus is reduced by £ 0 in the process, show that the energy of the 
photon is £, where £ = E 0 (l -E 0 /2Mc 2 ). 

38. The lines of motion of a particle having rest mass m 0 and a photon are 
perpendicular to one another. The total energies of the particle and photon are £. 
£ respectively. If the particle absorbs the photon, show that its rest mass is 
increased to M 0 , where = ml + 2EE/c*. 

39. A photon having energy £ is moving along the x-axis when it encounters a 
stationary particle having rest mass m 0 . The particle absorbs the photon and then 
emits another photon having the same energy in a direction parallel to the y-axis. 
Calculate the direction and magnitude of the final momentum of the particle and 
show that its rest mass is reduced to the value ^ (ml —2 E 2 /c*). 

40. A mass 3/.m(A > 1 ) at rest disintegrates into three fragments (each of rest 
mass m) which move apart in directions making equal angles with each other. 
Show that, in a frame of reference in which one of the fragments is at rest, the 
angle between the directions of motion of the other two fragments is 
2 cot - 1 (73/-). 

41. A positron travelling with velocity 3c/5 is annihilated in a collision with a 
stationary electron, yielding two photons which emerge in opposite directions 
along the track of the incoming particle. If m is the rest mass of the electron and 
positron, show that the photons have energies 3mc 2 /4 and 3mc 2 /2. 

42. A positron having momentum p collides with a stationary electron. Both 
particles are annihilated and two photons are generated whose lines of motion 
make equal angles a on opposite sides of the line of motion of the positron. Prove 
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that p sin a tana = 2 me where m is the rest mass of both the positron and electron. 
If y. = 60 , calculate the velocity of the positron. (Ans. 4c/5.) 

43. A particle of rest mass m 0 collides elastically with an identical stationary 
particle and, as a result, its motion is deflected through an angle 0. If Tis its KE 
before the collision and T is its KE afterwards, show that 


‘ \ +Tsin 2 6/2m 0 c 2 

44. A particle of rest mass m, collides elastically with a stationary particle of 
rest mass m 2 ( < m,| and, as a result, is deflected through an angle 0. If E, £’ are the 
total energies of the particle m, before and after collision respectively, prove that 

n (£ + m 2 c 2 )E' — m 2 c 2 E — m\c* 

= 

45. A pion having rest mass m 0 is moving along the x-axis of an inertial frame 
Oxyz with speed 4c/5, when it disintegrates into a muon having rest mass 2m 0 /3 
and a neutrino. The neutrino moves parallel to the y-axis. Prove that the angle 
made by the muon’s velocity with the x-axis is tan “ 1 ( 1/8) and calculate the energy 
of the neutrino. (Ans. m 0 c 2 / 6.) 

46. A nucleus having rest mass m 0 disintegrates when at rest into a pair of 
identical fragments of rest mass \/.m 0 (A < 1 ). Show that the speed of one particle 
relative to the other is 2^/(1 — A 2 )c/(2 - A 2 ). If A is small, show that this speed is 
less than c by a fraction a 4 /8. 

47. A positron collides with a stationary electron and the two particles are 
annihilated. T wo photons are generated, the lines of motion of which make angles 
of 30' and 90 c with the original line of motion of the positron. Calculate the 
original velocity of the positron and show that the energy of one of the photons is 
equal to the internal energy of the electron. (The rest masses of an electron and a 
positron are equal.) (Ans. yJ3c/2.) 

48. A moving positron collides with a stationary electron. Both particles are 
annihilated and two photons are generated, the lines of motion of which both 
make angles of 60 ’ with the original line ol motion of the positron. Prove that the 
total energy of each photon is 4m 0 r 2 /3, where m 0 is the rest mass of the positron 
and of the electron. 

49. A nucleus having rest mass m 0 is moving with velocity 4c/5 when it emits a 

photon having energy m 0 c 2 / 3 in a direction making an angle of 60 c with the line of 
motion of the nucleus. Show that the subsequent direction of motion of the 
nucleus makes an angle tan - 1 3/7) with its initial direction of motion and 

calculate the new rest mass of the nucleus. Show that, relative to an inertial frame 
in which the nucleus was initially at rest, the line of motion of the photon makes 
an angle of 120 c with the original direction of motion of the nucleus. (Ans. 
V 1 3m 0 /6.) 
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50. A pion has rest mass m 0 and momentum p when it disintegrates into a pair 
of photons having energies £ and £'. The directions of motion of the photons are 
perpendicular and that of the photon having energy £ makes an angle x with the 
original direction of motion of the pion. Prove that 

p = m 0 c(sin2a)“ 1 ,2 , £ = m 0 c 2 y/({ cot x), 

£' = m 0 c 2 tan x). 

5 1 . A particle having rest mass m 0 is moving with an unknown velocity when it 
absorbs a neutrino whose energy is 3m 0 c 2 /2. The angle between the paths of the 
particle and neutrino is x, where cos a = 1/3. After absorption, the rest mass of the 
particle is 2 m 0 . Calculate the original velocity of the particle and show that its 
path is deflected through an angle /?, where tan/? = 4 J2/5, as a result of the 
encounter. (Ans. 3c/5.) 

52. A particle whose rest mass is m 0 moves along the x-axis of an inertial frame 
under the action of a force 

2ffi 0 c 2 a 

(a-x) 2 

At time f = 0, the particle is at rest at the origin O. Show that the time taken for 
the particle to move from O to a point x ( < a) is given by 

0. + 3.) 

53. Oxy are rectangular axes of an inertial frame. A particle having rest mass 
m 0 is projected from the origin with momentum p 0 along Ox. It is acted upon by a 
constant force / parallel to Oy. Show that its path is the catenary 



where wl = m^c 4 + pgC 2 . 

54. A particle having rest mass m 0 moves along a straight line under the action 
of a frictional force of magnitude m 0 v/k opposing its motion; v is the speed of the 
particle and k is a constant. Show that the time which elapses whilst the particle’s 
velocity is reduced from 4c/5 to 3c/5 is [log (3/2) + 5/12]fc. 

55. A particle having rest mass m 0 moves on the x-axis under an attractive force 
to the origin of magnitude 2m 0 c 2 /x 2 . Initially it is at rest at x = 2. Show that its 
motion is simple harmonic with period 4n/c. 

56. A space ship, with its motors closed down, is moving at high velocity v 
through stationary interstellar gas which causes a retardation as measured by the 
crew of magnitude av 2 . Show that the distance it moves through the gas whilst its 
velocity is reduced from V to U is 


1 

a 


1 l, 1+x 

- — 2 log-: — - 

X 1 — X 


V 

U 


where x = (1 -v 2 /c 2 ) 1 ' 2 . 
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57. A particle has rest mass m 0 and 4-momentum P. An observer has 4-velocity 
V in the same frame. Show that, for this observer, the particle’s: 

(i) energy is - P- V; 

(ii) momentum is of magnitude ^/[P 2 -t- (P- V) J /c 2 ]; 

(iii) velocity is of magnitude ^/[l +c 2 P 2 /(P-V) 2 ]. 

(Hint: All these expressions are invariant.) 

58. A particle has rest mass m 0 and moves along the x-axis under the action of a 
force given at any point having coordinate x by 

m 0 c 3 cu 2 x 

^ (c 2 - co 2 a 2 + co 2 x 2 ) 3/2 

co and a being constants. It is projected from the origin with velocity too. Show 
that its velocity at any later time is given by t> 2 = co 2 (a 2 — x 2 ). What does this 
imply for the particle’s motion? 

59. A particle of rest mass m 0 moves along the x-axis of an inertial frame under 
the action of a force 

f __ m o c 2 
J 2(1 +2 x 1,2 ) 3/2 

At time i = 0, the particle is at rest at the origin. Show that, at any later time t, its 
coordinate is given by 

x = 2 + ct -2^/(1 -t -ct) 

60. A particle of rest mass m 0 moves under the action of a central force, (r, 6) 
are its polar coordinates in its plane of motion relative to the force centre as pole. 
V(r) is its potential energy when at a distance r from the centre. Obtain Lagrange’s 
equations for the motion in the form 

d 1 d , • 

— (yr) - yr6 2 + — V' - 0, — (yr 2 9) = 0 
dt m 0 df 

where y = [1 - (r 2 + r 2 0 2 )/c 2 ] ' 1 12 . Write down the energy equation for the 
motion and obtain the differential equation for the orbit in the form 




C-V 


V 


m},c- 


where u = 1/r and h , C are constants. In the inverse square law case when V = 
— ft/r, deduce that the polar equation of the orbit can be written 

lu = I + e cos iff? 

where rj 2 = 1 ~n 2 /mlh 2 c 2 . If n/m 0 hc is small, show that the orbit is ap- 
proximately an ellipse whose major axis rotates through an angle nn 2 /mlh 2 c 2 per 
revolution. 

61. A particle, having rest mass m 0 , is at rest at the origin of the x-axis at time 
t = 0. It is acted upon by a force /, directed along the positive x-axis, whose 
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magnitude when the particle’s velocity is v is given by / = m 0 kc 2 /v. Show that at 
time t( > 0), v = c sin#, where 6 is positive acute and stcd = 1 + kt. Deduce that, 
at the same time, the coordinate of the particle is given by x = c(tan# — 6)/k. 

62. If v is the 3-velocity of a particle and j? = (1 — v 2 /c 2 )~ '' 2 , prove that v v 
= vv and 


v^(/?v) = p 3 vv 

If m 0 is the particle’s rest mass, define the 3-force f acting upon it and deduce from 
the above result that v-f = me 2 , where m is the inertial mass. 

63. A particle having rest mass m 0 moves along the x-axis under a force of 
attraction towards the origin - m 0 a> 2 x. It is initially at rest at the point x = a. 
Show that the velocity with which it passes through the origin is 

awe N /(4e 2 +a> 2 a 2 ) 

2c 2 + a) 2 a 2 

64. If the force f always acts along a normal to a particle’s path, show that the 
speed v of the particle is constant. Write down the equation of motion of the 
particle and deduce that the curvature of the path is given by k = //mi' 2 . If the 
particle moves in a circle of radius a under a constant radial force /, show that its 
speed v is given by 


v 2 = 2c 2 /.[ ^/(A 2 + 1 ) - AJ 

where X = fa/2m 0 c 2 and m 0 is the particle’s rest mass. 

65. A nucleus is moving along a straight line when it ejects an electron. As 
measured by a stationary observer, the speed of the electron is and the angle 
between the lines of motion of the nucleus and electron is 60'. If the speed of the 
electron relative to the nucleus is also jc, calculate the speed of the nucleus. (Ans. 
8c/17.) 

66. A particle having rest mass m 0 , initially at rest at the origin of an inertial 
frame, moves along its x-axis under the action of a variable force/directed along 
the axis and given by the formula / = m 0 c 2 / 2 ^/(l + x). Show that the particle's 
velocity v is given by v = x ,,2 c/( 1 4- x) 1 /2 . Putting x = sinh 2 fl, if r is the time and t 
= 0 at O, prove that < t = 0 + sinhOcoshtf 

67. A particle having rest mass m 0 is moving with speed jc when it is subjected 
to a retarding force. When the particle’s inertial mass is m, the magnitude of the 
retarding force is am 2 (a is constant). Show that the time needed by the force to 
bring the particle to rest is nc/6m 0 <x. 

68. If 7/ is the energy-momentum tensor for an elastic fluid and V t is its 4- 
velocity of flow, by verifying the equation 7j y Vj = - c 2 /i 00 V, in a frame in which 
the fluid is momentarily at rest, prove it in any frame. Deduce the equations 

ffa = (/^OO^ + r,„iyc 2 )/(l - V 2 /C 2 ), V = Hoo + 9^JC 2 
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Hence derive the following formulae for the elements of Tj/. 

= PoO Kj + T a0 + T IV 

7”i4 = T i3 = f. i 00 V t + Vf V Jc 2 

Ttt^fiooVtVt-^V'V'/c 2 

69. A perfect fluid is streaming radially outwards across the surface of a sphere 
with radius R and centre O. If the motion is steady and there is no external force 
field, show that equation (21.20) leads to the equations 

!v»A) + r^-=0, r^ + 3A = 0 

dr dr dr 

where r is radial distance from O, p is the pressure, v is the speed of flow and X 
— (Poo + P/c^Ml -u 2 /c 2 )- If p vanishes over the sphere r = R and p = P at 
great distances, and if p 00 is constant outside the sphere, show that in this region 

P = (P + c 2 p ooW* 1 -rV 2 ) -c 2 Poo 
r 2 e(l — v 2 /c 2 )~ = R 2 sJ{P 2 +2Pc 2 p 00 )/cp 00 

70. A straight rod has cross-sectional area A and mass m per unit length. It lies 
along the x-axis of an inertial frame in a state of tension F. Show that the 
energy momentum tensor has components which are the elements of a 4 x 4 
diagonal matrix, with diagonal elements ( - F/A , 0, 0, - me 2 /A). Deduce that an 
observer moving along the x-axis with speed u, sees the inertial mass per unit 
length of the rod to be 

m - Fu 2 /c 4 
1 - u 2 /c 2 


Deduce that F cannot exceed me 2 . 

71. Assuming that the energy-momentum tensor T tJ is a tensor with respect to 
a general Lorentz transformation x f = a, 7 x ; + write down the transformation 
equations for T ti in the special case where = a j4 =0, a 44 = 1. Deduce that 

are 3-tensors with respect to a simple rotation of the frame Ox,x 2 x 3 
without relative motion. 

72. Relative to a frame S, a fluid has flow velocity (u, 0, 0) at a certain point. In 
the frame S° relative to which the fluid is stationary at the point, the stress tensor 
has components and the fluid density is p 00 . Show that the energy-momentum 
tensor in the frame S has components 


T _ T l 1 + PooU 2 
" “ 1 -u 2 /c 2 ’ 

T l3 = (I -U 2 /C 2 )~ 1 ' 2 1?3, 


Tn — (1 —U 2 / C 2 )~ 


1/2 


T 


0 

12 ' 


_ c 2 Poo + T? | iu 
1 — u 2 /c 2 c ’ 


T 2l = (1 -u 2 /c 2 ) ,,2 tS I( 


^22 — T 22' P 2 3 — r 23' 
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T 2 4 = (1 -u 2 /c 2 ) li2 x° l2 iu/c, r 31 = (1 -u 2 /c 2 )“ I;2 tS,, 

^32 = t 32 > ^33 = * 33 ' 7~34 = (1 — ^ 1 ^) * ' 2 T ? 3 *“/<■'» 

■r _ C 2 H 00 + T° u u 2 /c 2 

* 44 * 2/2 * 

1 - U £ JC 

and deduce that 

T n=*n, Ti: = (! -“ J /e 2 )' 1,2 t? 2. t,3 = (1 -u 2 /c 2 y i,2 t?j, 

*21 = (1 -U 2 /C 2 ) 1/2 t5,, T 22 = T? 2 , *23 = * 23 ' 

*31 = (1 — U 2 ^ 2 ) 1 2 t 3 I » *32 = * 32 > T 33 = * 33 ' 

= (Poo + T? , U 2 /c*)/( 1 - K 2 /C 2 ). 



CHAPTER 4 


Special Relativity Electrodynamics 


24. 4-Current density 

In this chapter we shall study the electromagnetic field due to a flow of charge 
which will be assumed known. Relative to an inertial frame S, let p be the charge 
density and v its velocity of flow. Then, if j is the current density, 

j = pv (24.1) 

Assuming that charge can neither be created nor destroyed, the equation of 
continuity 

divj + ^ = 0 (24.2) 

ct 

will be valid for the charge flow in S. This equation must be valid in every inertial 
frame and hence must be expressible in a form which is covariant with respect to 
orthogonal transformations in space time. Introducing the coordinates x< by 
equations (4.4) and employing equation (24.1), equation (24.2) is seen to be 
equivalent to 

■”> 

j— (P»,) + — ■ + (pv z ) + ~(icp) = 0 (24.3) 

cx, cx 2 <?x 3 cx 4 

This equation is covariant as required if {pv x , pv y , pv z , icp) are the four 
components of a vector in space-time. For, if J is this vector, equation (24.3) can 
be written 

J t . i = 0 (24.4) 

and this is covariant with respect to orthogonal transformations. Now, by 
equation (15.6), 

J = (pv, icp) = p(l — p 2 /c 2 )' 2 V (24.5) 

where V is the 4-velocity of flow and hence J is a vector if p(l -r 2 /c 2 )' 2 is an 
invariant. Denoting the invariant by p 0> we have 


P = 


P o 

(1 -u 2 /c 2 ) 1,2 


(24.6) 
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It follows that p = p 0 if v = 0 and hence that p 0 is the charge density as measured 
from an inertial frame relative to which the charge being considered is 
instantaneously at rest. p 0 is called the proper charge density. 

J is called the 4-current density and it is clear from equation (24.5) that 

J = Po v = (j. KP) l 24 - 7 ) 

It is now clear that, when J has been specified throughout space- time, the charge 
flow is completely determined, for the space components of J fix the current 
density and the time component fixes the charge density. Hence, given J, the 
electromagnetic field must be calculable. The equations which form the basis for 
this calculation will be derived in the next two sections. 

Let dar 0 be the volume of a small element of charge as measured from an 
inertial frame S 0 relative to which the charge is instantaneously at rest. The total 
charge within the element is p 0 da> 0 . Due to the Fitzgerald contraction, the volume 
of this element as measured from S will be da>, where 

da) = (1 — v 2 /c 2 )' 2 da> 0 (24.8) 

The total charge within the element as measured from S is therefore 

pdw = p( 1 - v 2 /c 2 )' 2 dw 0 = /> 0 dco 0 (24.9) 

by equation (24.6). It follows that the electric charge on a body is invariant for all 
inertial observers. 


25. 4-Vector potential 

In classical theory, the equations determining the electromagnetic field due to a 
given charge flow are Maxwell’s equations (3.1) (3.4). To ensure covariance of the 
laws of mechanics with respect to Lorentz transformations, it proved necessary to 
modify classical Newtonian theory slightly. However, it will be shown that 
Maxwell’s equations are covariant without any adjustment being necessary. 
Indeed, the Lorentz transformation equations were first noticed as the transform- 
ation equations which leave Maxwell’s equations unaltered in form. 

To prove this, it will be convenient to introduce the scalar and vector potentials, 
4> and A respectively, of the field. It is proved in textbooks devoted to the classical 
theory (Coulson and Boyd, 1979) that A satisfies the equations 

1 ccb 

div A = 0 (25.1) 

c 2 £t 

1 3 2 A 

v 2 A-- 2 = -f<oj (25.2) 


and tp satisfies the equation 


V 2 0 


i ^ 2 d> 


-P/« 0 


(25.3) 
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where c 2 = l/^ 0 r; 0 . We now define a 4-veclor potential Q in any inertial frame S 
by the equation 

G = (A.«*/r) (25.4) 


It is easily verified that equations (25.2), (25.3) are together equivalent to the 
equation 

CPO = -/i 0 J (25.5) 

where the operator Q 2 is defined by 


- t: 2 + ^”5 


C“ c 

2 4" ^ 2 4" ~ 2 


(25.6) 


The space components of equation (25.5) yield equation (25.2) and the time 
component, equation (25.3). If (2„ J, are the components of O and J respectively, 
equation (25.5) can be written 

«,,, = ~ Ho J, (25.7) 

in which form it is clearly covariant with respect to Lorentz transformations 
provided Q is a vector. This confirms that equation (25.4) does, in fact, define a 
quantity with the transformation properties of a vector in space-time. 

Next, it is necessary to show that equation (25.1) is also covariant with respect 
to orthogonal transformations in space-time. It is clearly equivalent to the 
equation 

div£)=G i( = 0 (25.8) 


which is in the required form. 

J being given, Q is now determined by equations (25.7) and (25.8). 


26. The field tensor 


When A and (pare known in an inertial frame, the electric and magnetic intensities 
E and B respectively at any point in the electromagnetic field follow from the 
equations 

E = -grad <f> (26.1) 

c( 

B = curl A (26.2) 


Making use of equations (4 4) and (25.4), these equations are easily shown to be 
equivalent to the set 


i r 

5Q> 

cQ x 't 

E x 

c 

dx x 

cx 4 

i 

dQ A 

dQ 2 

C 

CXi 

1 * 

X 

1^ 

1 

— -E. 
c ‘ 

cQ a 

CX 3 

dQy 
cx 4 , 


(26.3) 
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cO 3 


cx 2 


cQ t 

r*l I 

C! 

] 

1 

rx 3 

CX i 

cQi 

JO, 

CX j 

dx 2 j 


(26.4) 


Equations (26.3) and (26.4) indicate that the six components of the vectors iE/c, 
B with respect to the rectangular Cartesian inertial frame S arc the six distinct 
non-zero components in space time of the skew-symmetric tensor £), , -Q, j. 
We have proved, therefore, that equations (26.1), (26.2) are valid in all inertial 
frames if 


/ 0 B z - B y 

/ -B, 0 B x 

l B, -B x 0 
\iEJc iE y /c iEJc 


- iEJc \ 

- ‘Ey/C 

- iEjc 


(26.5) 


is assumed to transform as a tensor with respect to orthogonal transformations in 
space- time. The equations (26.3) and (26.4) can then be summarized in the tensor 
equation 


F„= (26.6) 

F,j is called the electromagnetic field tensor. The close relationship between the 
electric and magnetic aspects of an electromagnetic field is now revealed as being 
due to their both contributing as components to the field tensor which serves to 
unite them. 

Consider now equations (3.2) and (3.3). Employing the field tensor defined by 
equation (26.5) and the current density given by equation (24.7), and recalling that 
B = Fo H, D = £ 0 E, these equations are seen to be equivalent to 


or, in short. 


CF .2 cF u cF 14 'N 

~ + = /<0 J 1 

CX 2 €7X3 CX x 

cF 2 1 cF is &F 2i 

— j_ f — — g 0 J 2 

C'X, CX } t ; X 4 


cF 31 + SF 3 2 ^ cF 34 
rx, cx 2 c’x 4 

cF 4 1 ^ cF 42 ^ <?/• 43 

rX| <?x 2 Sx 3 


> 

= Bo^s 
— Bo^a 


F t j. j Bo 


(26.7) 


(26.8) 
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an equation which is covariant with respect to Lorentz transformations. 
Finally, consider equations (3.1) and (3.4). These can be written 


34 C'F 42 + ?F 2i 
dx 2 CX 3 CX 4 


cF 4 , | cF , 3 | ?F 34 
cx 3 <?x 4 ex | 


= 0 


24 ^ t?F 4 1 
cx 4 <?x, cx 2 




£F„ cF 31 ^ cF , 2 _ Q 

<?X, CX 2 £Xj , 


(26.9) 


These equations are summarized thus: 

Fv.t + F^. + Fw.y- 0 (26.10) 

If any pair from are equal, since F tj is skew-symmetric, the left-hand member 
of this equation is identically zero and the equation is trivial. The four possible 
cases when i, j, k are distinct are the equations (26.9). Equation (26.10) is a tensor 
equation and is therefore also covariant with respect to Lorentz transformations. 
To sum up. Maxwell’s equations in 4-dimensional covariant form are: 


F„, j = FoJi) 

F a. * + F Jk , + F kt , = 0 j 


(26.11) 


Given Jj at all points in space-time, these equations determine the field tensor F i2 . 
The solution can be found in terms of a vector potential which satisfies the 
following equations: 

. , = 0 

Q.jJ = -Ho J i 


(26.12) 


Q, being determined, F ;j follows from the equation 


F a = Qj. i ~ ^i. i 


(26.13) 


27. Lorentz transformations of electric and magnetic vectors 


Since F fJ is a tensor, relative to the special Lorentz transformation (5.1) its non- 
zero components transform thus: 


F 2i = F 23 

F 3 , = F 3 , cos a + F 34 sin a 
F"i 2 = F^cosa + F^sina 
Fl 4 = F| 4 

F 24 = ~ F 21 sin a + F 24 cos a 
F 34 = — F 31 sin a + F 34 cos a 


(27.1) 


(27.2) 
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Substituting for the components of F tj from equation (26.5) and for sin x, cos a 
from equations (5.7), the above equations (27.1) yield the special Lorentz 
transformation equations for B, viz. 

B x = B x , By = p{B y + (u/c 2 )E z ), B 2 = d(B z -[u/c 2 )E y ) (27.3) 
Similarly, equations (27.2) yield the transformation equations for E. viz. 

E x = E x , E y = P(E y - uB 2 ), E z = li(E : + uB y ) (27.4) 

The inverse equations can be written down by exchanging ‘barred’ and 'unbarred' 
symbols and replacing u by — u. 

As an example of the use to which these transformation formulae may be put, 
consider the electromagnetic field due to an infinitely long, uniformly charged 
wire lying at rest along the x-axis of the inertial frame S. If q is the charge per unit 
length, it is well known that the electric intensity is everywhere perpendicular to 
the wire and is of magnitude q/(2nc 0 r), where r is the perpendicular distance from 
the wire. Thus, at the point (x, y, z), the components of E are given by 


E x = 0 , 


£ = VL 

y 2m 0 (f 


£= m 

-f-z 2 )’ z 2ne. 0 {y 2 


+ r) 


(27.5) 


The magnetic induction vanishes. 

The electromagnetic field observed from the parallel inertial frame S (relative 
to which S has velocity (u, 0, 0) ) is given by the inverses of equations (27.3) and 
(27.4) to have components 


= n r = P- qZ r - H uqy ..- 

” 2ne 0 c 2 (y 2 +z 2 )' z 2nc 0 c 2 (,y 2 + z 2 ) 

£,,0. e ,_ 

27t£ 0 (y 2 +z 2 ) 2 tc 0 (>' 2 + z 2 ) 


(27.6) 


at the point (x, y, z) (having used the transformation equations y = y, z = z). A 
segment of the wire having unit length in S will appear in S to have length 
■JO -u 2 /c 2 ); however, the charge on the segment must be the same in both 
frames, viz. q. It follows that the charge per unit length as observed in S is q = ) 3q. 
Thus, the charge which flows past a fixed point on the x-axis of S in unit time will 
be (jucj = i; i therefore measures the current flowing along this axis. Since c 1 
= 1/r 0 £ o. equations (27.6) can now be written 


B, = 0, B, 


E x = 0 , Ey - ; 


Fo>z g Bo O' 

2jr(y 2 + z 2 )’ 1 2it(y 2 + z 2 ) 


<ty 




(27.7) 


2nc 0 (y 2 +z 2 )' 1 2itCo(y 2 + z 2 ) J 

These equations imply that the magnetic induction is of magnitude n 0 i/2nr and 
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that the B-lines are circles with centres on Ox and planes parallel to Oyz. This 
result for a long straight current is a well-known one in the classical theory. The 
electric intensity is of magnitude q/2nc 0 r and is directed radially from the wire; 
however, in the case of a current due to the flow of negatively charged electrons in 
a stationary wire, this field is cancelled by the contrary field due to the positive 
charges on the atomic nuclei. 

28. The Lorentz force 

We shall now calculate the force exerted upon a point charge e in motion in an 
electromagnetic field. 

At any instant, we can choose an inertial frame relative to which the point 
charge is instantaneously at rest. Let E 0 be the electric intensity at the point 
charge relative to this frame. Then, by the physical definition of electric intensity 
as the force exerted upon unit stationary charge, the force exerted upon e will be 
cE 0 . It follows from equation (18.2) that the 4-force acting upon the charge in this 
frame is given by 

F =» (cE 0 , 0) (28.1) 

The 4-velocity of the charge in this frame is also given by 

V = (0, ic) (28.2) 

and hence, by equation (26.5), 

eF a V i = e ( E *o, 0) = (eE 0 , 0) (28.3) 

It has accordingly been shown that, in an inertial frame relative to which the 
charge is instantaneously stationary, 

Fj = eF.jVj (28.4) 

But this is an equation between tensors and is therefore true for all inertial frames. 

Substituting in equation (28.4) for the components F { , Fij, V i from equations 
(18.2), (26.5) and (15.6) respectively, the following equations are obtained: 

f x = e{B, v y - B y v, + E x ) 

f, = e(B x v z - B z c x + E y ) (28.5) 

fz = e(B y v x - B x v y + E.) 

These equations are equivalent to the 3-vector equation 

f = e(E + v x B) (28.6) 

f is called the Lorentz force acting upon the charged particle. 

29. The energy-momentum tensor for an electromagnetic field 

Suppose that a charge distribution is specified by a 4-current density vector J. If 
dai 0 is the proper volume of any small element of the distribution and p 0 is the 
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proper density of the charge, the charge within the element will be p 0 da> 0 . It 
follows from equation (28.4) that the 4-force exerted upon the element by the 
electromagnetic field is given by 

Fi = PoFijVjduo (29.1) 

V being the 4-velocity of flow for the element. Employing equation (24.7), this last 
equation can be written 

F, = F^Jj dto 0 (29.2) 

and it follows from the definition given in section 21 that the 4-force density for 
the electromagnetic field is given by 

0, = F it J i (29.3) 

Substituting for J s from the first of equations (26.1 1), we can express £)* in terms 
of the field tensor thus: 

0, = -F,/** (29.4) 

Fo 

We will now prove that the right-hand member of this equation is, apart from 
sign, the divergence of a certain symmetric tensor S fJ given by the equation 


HoS.j = F ik Fj k -ldijF u F k , (29.5) 

and called the energy -momentum tensor of the electromagnetic field. 

Taking the divergence of S (> we have 



FoSij.j — F ik j F jk + f — %&ijF kiFu.j 

(29.6) 

Now 

F ik.jF jk F ii k b kj — F k F jk 

(29.7) 

since F tJ is 

skew-symmetric. Thus 



Fik.jF jk = j{F ik] + F Jtk )F Jk 

(29.8) 

Also 

dijF k iF u j = F k ,F U i = — h jk F kJ , 

(29.9) 

and it follows from these results that the first and last terms of the right-hand 
member of equation (29.6) can be combined to yield 


z(f i*,/ + Fjj k + F k j i)Fj k 

(29.10) 

and this is 
Hence 

zero by the second of equations (26.11). 



6'i 7 , j — F ik F jkj — F ik F kj j — — £>, 

Fo Fo 


(29.11) 
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Substituting for the components of the field tensor from equation (26.5), the 
components of S 0 are calculable from equation (29.5) as follows: If a, jS take any 
of the values 1, 2, 3, then writing E t for E x , E 2 for E y , etc., 

S„, = - (e 0 E x E„ + g 0 H,H f ), a f ft (29.12) 

If / =7 = 1 , 

= — (A'oWi + £o£i) + i(^o ^ 2 + E o E 2 ) (29.13) 

S 2 2 . $33 may be expressed similarly and therefore, in general, if a, /), = 1, 2, 3, 

S x p = — (eqE^Eq + + \6 1 ^(CoE 2 + g 0 H 2 ) (29.14) 

Apart from sign, this is Maxwetfs stress tensor t, ; . t tj is only a tensor with respect 
to rectangular frames stationary in the inertial frame being employed. 

Also, if a = 1, 2, 3, 

S xi = St x = -(E 2 H i -E i H 2 ,E i H l -E t H 3 ,E l H 2 -E 2 H l ) 
c 

= - E x H = * S (29.15) 

c c 

where S is Poynting's vector. 

Finally, 

S 44 = - i (£ 0 £ 2 + ^0 W 2 ) = - U (29.16) 

where U is the energy density in the electromagnetic field. 

These results may be summarized conveniently by exhibiting the components 
of Sjj in a matrix thus: 



We can now write down the equation of motion for a charge cloud moving 
under the action of the electromagnetic field it generates. If T tj is the kinetic 
energy-momentum tensor for the cloud, equations (21.15) and (29.1 1) show that 
the equation of motion can be written 

T,,.,= -S„ w (29.18) 

or 

(7^> + S i; ), i = 0 (29.19) 

i.e. the divergence of the total energy-momentum tensor vanishes. If the charged 
particles forming the cloud do not interact except via the electromagnetic field, i.e. 
the cloud is incoherent, T (J is given by equation (21.16). If, however, the particles 
constitute an ionized fluid, equations (22.19) or (22.21) must be used to calculate 
T 

* iy 

It was shown in section 21 that T,Jic equals the density of the x , -component of 
the linear momentum of a system. Since S„Jic = SJc 2 , the density of the linear 



82 


momentum of an electromagnetic field is g = S/c 2 , where S is Poynting’s vector. 
Alternatively, as explained in section 21, g can be interpreted as the current 
density vector for the inertial mass flow and thus,c 2 g = S gives the rate of energy 
flow across unit area placed perpendicular to the direction of this flow; this is the 
usual significance attached to Poynting’s vector. 

According to the theory in section 21, - S 44 /c 2 should equal the density of 
inertial mass for the field. We have found that -S 44 /c 2 = U/c 1 and, since U is the 
energy density, our result is in conformity with expectations. 

The results which have been obtained may be summarized as follows: If 
momentum of density S/c 2 and energy of density U are ascribed to the 
electromagnetic field, equation (29.19) shows that the net momentum and energy 
of the field and charge will be conserved. 


Exercises 4 

1. Write down the special Lorentz transformation equations for J and deduce 
the transformation equations for j, p, viz. 

Tx = (1 -u 2 /c 2 r' 2 (; x -pu), J y = j y 
p = (l -u'/c 2 )-' 2 (p-j x u/c 2 ), J=j , 

2. Deduce from the Maxwell equation F {j , = p 0 J, that div J = 0. 

3. Verify that the field tensor defined in terms of the 4-potential by equation 
(26.13) satisfies Maxwell’s equations (26.11) provided Q i satisfies the equations 
(26.12). 

4. (i) Prove that 

F ijF ij = 2n 0 (n 0 H 2 —c 0 E 2 ) 

and deduce that p 0 W 2 -«o£ 2 is invariant with respect to Lorentz 
transformations. 

(ii) Prove that 

^iiu F n F u= — 8i E • B/c 

and deduce that E B is an invariant density with respect to Lorentz 
transformations. 

5. If U istheenergy density and Sisthe Poynting vector for an electromagnetic 
field, prove that U 2 - S 2 is invariant. 

6. An observer O at rest in an inertial frame O xyzi finds himself to be in an 
electric field E = (0, £, 0), with no magnetic field. Show that an observer O 
moving according to O with uniform velocity V at right angles to E, finds electric 
and magnetic fields E, B connected by the relation 

c 2 B+Vx E = 0 

7. If Sij is the energy- momentum tensor for an electromagnetic field, prove 
that its trace, viz. Sa, is zero. 
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8. A plane monochromatic electromagnetic wave is being propagated in a 
direction parallel to the x-axis in the inertial frame S. Its electric and magnetic 
field components are given by 

E = [0, nsincoft — x/c), 0] 

B = [0, 0, -sina>(r — x/c)] 


Show that, when observed from the inertial frame S, it appears as the plane 
monochromatic wave 


where 


E = [0, AnsinAaiff- x/c), 0] 


B = [0, 0, A-sinA<w(t — x/c)] 


1 - u/c 
TTi 7/c 


u being the velocity of S relative to S (i.e. both the amplitude and frequency are 
reduced by a factor /. The reduction in frequency is the Doppler effect.) 

9. Show that the Hamiltonian for the motion of a particle with charge e and 
mass m in an electromagnetic field (A, ff) is 


H = c 




+ m 2 c 2 


1/2 


+ e<t) 


(Hint: Show that Hamilton’s equations yield the equation 


d 

--(m») = e(E + v x B).) 
di 


10. Verify that, in a region devoid of charge, equations (26.12) are satisfied by 

52 , = A i e‘ k ' Xl ’ 

provided A;, k p are constants such that 

= 0 , k^„ = 0 

By considering the 4-vector property of S2 ( , deduce that A { must transform as a 4- 
vector under Lorentz transformations. Deduce also that kpX p is a scalar under 
such transformations and hence that k p is a 4-vector, 

A plane electromagnetic wave, whose direction of propagation is parallel to the 
plane Oxy and makes an angle a with Ox, is given by 

Q _ '£2ffiv(xcosa + ysma - ci|/c 

where v is the frequency. The same wave observed from a parallel frame Oxyz 
moving with velocity u along Ox, has frequency v and direction of propagation 
making an angle 5 with Ox. By writing down the transformation equations for the 



vector k p , prove that 


u 

cosot — 
c 


u 

1 — cosa 
c 

1 1. Oxyz is an inertial frame S. A particle having rest mass m 0 and electric 
charge q moves in the xy-plane under the action of a uniform magnetic field B 
directed along the z-axis. Show that the particle's speed v is constant and that, 
with a suitable choice of coordinates, its trajectory is the circle 

x = R sin air, >• - Rcosan 

where 

c) = qB/(im 0 , R = v/o), j? = (1 — r 2 /c 2 )~ 1,2 

S is an inertial frame Oxpz parallel to S and O moves along Ox with speed u. 
Calculate u and B so that uniform fields E = (0, £ 0 , 0), B = (0, 0, B 0 ) are observed 
in S. Hence describe the motion of a charged particle released in these fields and 
show that its average velocity is E 0 /B 0 along the x-axis. 

1 2. The frame S is parallel to the frame S and is moving along the x-axis with 
speed u. In the frame S, there is a uniform electric field (0, £, 0) and a uniform 
magnetic field (0, 0, B). Show that it is possible to choose the value of u so that 
the field in the frame S is entirely magnetic and that its magnitude is then 

B 2 -£ 2 /c 2 ). What is the direction of this field? (Ans. Parallel to z-axis.) 

13. A charge q has rest mass m 0 and is moving in the positive sense along a 
negative x-axis with speed u, when it enters a magnetic field, having components 
B x = B y = 0, A, = B (constant), confined to the region 0 ^ x ^ a. There is no 
field in the regions x < 0, x > a. Explain why the inertial mass of the charge 
remains constant during its motion through the field and show that its path is the 
circle 

x 2 + y 2 + 2ky = 0, z = 0 


1 — cos x 
c 


cosx = 


where k = m 0 u/qB{ 1 -u 2 /c 2 ) w2 . What is the condition that the charge will be 
turned back by the field? (Ans. k < a.) 

14. A plane electromagnetic wave of frequency / is being propagated in a 
direction making an angle 6 with the x-axis. Its electric and magnetic field 
components are given by 


where 


E = ( - A Xs'\n8, A Xcosd, 0) 
B = (0, 0, AX/c) 


xcosO -t-ysinfl 
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Show that, when observed from a frame S which is parallel to Oxyz and moves 
with a velocity (t cosft, 0, 0) relative to Oxyz, the wave has components 

E = ( — /IXsinft, 0, 0) 

B = (0, 0, A X sinW/V) 

where 

X = sin[2n/sinfl(/ - — y/c)] 

and (x, y, z, t ) are space- time coordinates in S. What is the direction of 
propagation in S and what is the observed frequency? (Ans. Parallel to y-axis at 
frequency / sind.) 

1 5. V { is the 4-velocity of flow of a conducting medium and J, is the 4-current 
density of a charge flow in the medium. Ohm’s law is valid for the medium, a being 
its conductivity. Prove that 

{Him: Verify this equation in a frame for which the medium is at rest using Ohm’s 
law j = <tE.) 

16. A uniform magnetic field of induction B is directed along the z-axis of an 
inertial frame. Show that the energy momentum tensor for the field has 
components which are the elements of a 4 x 4 diagonal matrix, with diagonal 
elements B 2 / 2/4 0 (l, 1, - 1, - I). 

17. A point chargee moves along the z-axis of an inertial frames with constant 
velocity v. Calculate the electromagnetic field in a parallel inertial frame whose 
origin moves with the charge and deduce the field in S. Hence show that, at the 
instant t = 0, when the charge passes through the origin O of S, the electric field is 
directed radially from O and its magnitude at the point having spherical polar 
coordinates (r, 0 , 4>) is given by 

E = — — 7 (1 — 1 ' 2 /< 2 ) ( 1 -t; 2 sin 2 0/c 2 r 3,2 

4nf. 0 r 2 

Show, also, that the magnetic field in S at this instant is given by B = (v x E)/c 2 . 



CHAPTER 5 

General Tensor Calculus. Riemannian Space 


30. Generalized N-dimensional spaces 

In Chapter 2 the theory of tensors was developed in an N-dimensional Euclidean 
space on the understanding that the coordinate frame being employed was always 
rectangular Cartesian. If x„ x,+dxi are the coordinates of two neighbouring 
points relative to such a frame, the ‘distance’ ds between them is given by the 
equation 

ds 2 = dx,dxj (30.1) 

If x„ Xj + dx, are the coordinates of the same points with respect to another 
rectangular Cartesian frame, then 

ds 2 = dx,dXi (30.2) 

and it follows that the expression dx,dx, is invariant with respect to a 
transformation of coordinates from one rectangular Cartesian frame to another. 
Such a transformation was termed orthogonal. 

Now, even in S 3 , it is very often convenient to employ a coordinate frame which 
is not Cartesian. For example, spherical polar coordinates ( r , 0 , f>) are frequently 
introduced, these being related to rectangular Cartesian coordinates (x, y, z) by 
the equations 

x = rsin#cos0, y = rsinSsin^), z = rcosd (30.3) 

In such coordinates, the expression for ds 2 will be found to be 
ds 2 = dx 2 +dy 2 + dz 2 

= dr 2 +r 2 d0 2 + r 2 sin 2 0d<p 2 (30.4) 

and this is no longer of the simple form of equation (30.1). The coordinate 
transformation (30.3) is accordingly not orthogonal. In fact, it is not even linear, 
as was the most general coordinate transformation (8. 1 ) considered in Chapter 2. 

The spherical polar coordinate system is an example of a curvilinear coordinate 
frame in <? 3 . Let (u, v, w) be quantities related to rectangular Cartesian coordinates 
(x, y, z) by equations 

u = u(x, y, z), v = e(x, y, z), w = w(x, >•, z) (30.5) 

86 



87 


such that, to each point there corresponds a unique triad of values of (u, v, w)and 
to each such triad there corresponds a unique point. Then a set of values of 
(u, i, w) will serve to identify a point in <? 3 and (u, t, w) can be employed as 
coordinates. Such generalized coordinates are called curvilinear coordinates. 

The equation 

u|x, v, z) = u 0 (30.6) 

where u n is some constant, defines a surface in over which u takes the constant 
value u 0 . Similarly, the equations 

v = i 0 , w = u 0 (30.7) 

define a pair of surfaces on which v takes the value c 0 and w the value w 0 
respectively. These three surfaces will all pass through the point P 0 having 
coordinates (u 0 , r 0 , vv 0 )as shown in Fig. 6. They are called the coordinate surjaces 
through P 0 . The surfaces v = i 0 , w = w 0 will intersect in a curve P 0 U along which 
r and w will be constant in value and only u will vary P 0 U is a coordinate line 
through P 0 . Altogether, three coordinate lines pass through P 0 . The equations u 
= constant, i = constant, w = constant define three families of coordinate sur- 
faces corresponding to the three families of planes parallel to the coordinate 
planes x = 0, y = 0, z = 0 of a rectangular Cartesian frame. Pairs of these 
surfaces intersect in coordinate lines which correspond to the parallels to the 
coordinate axes in a Cartesian frame. 


w 



Solving equations (30.5) for (x, >, z) in terms of (u, t, w), we obtain the inverse 
transformation 


x = x(u, r, w), y = y(u, r, vv), z=z(u,v,w ) (30.8) 

Let-fx, y, z), (x + dx, y + dy, z + dz) be the rectangular Cartesian coordinates of 
two neighbouring points and let (u, v, w), (u + du, v + dr, w -t- dw) be their 
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respective curvilinear coordinates. Differentiating equations (30.8), we obtain 

dx = — du +^-dr + — dw, etc. (30.9) 

c u ci cw 

Thus, if ds is the distance between these points, 
d.s 2 = dx 2 + dy 2 + dr 2 

= A du 2 + Bdv 2 + Cdw 2 + 2Fdrdw + 2Gdwdu + 2Hdudv (30.10) 

giving the appropriate expression for ds 2 in curvilinear coordinates. It will be 
noted that the coefficients A, B , etc., are, in general, functions of (u, c, vr). 

If, therefore, curvilinear coordinate frames are to be permitted, the theory of 
tensors developed in Chapter 2 must be modified to make it independent of the 
special orthogonal transformations for which ds 2 is always expressible in the 
simple form of equation (30.1). The necessary modifications will be described in 
the later sections of this chapter. However, these modifications prove to be of such 
a nature that the amended theory makes no appeal to the special metrical 
properties of Euclidean space, i.e the theory proves to be applicable in more 
general spaces for which Euclidean space is a particular case. This we shall now 
explain further. 

Let (x\x 2 , . . . , x s ) be curvilinear coordinates in £ N .* Then, by analogy with 
equation (30.10), if ds is the distance between two neighbouring points, it can be 
shown that 

ds 2 = g^dx'dx' (30.11) 

where the coefficients g tj of the quadratic form in the x j will, in general, be 
functions of these coordinates. Since the space is Euclidean, it is possible to 
transform from the curvilinear coordinates x' to Cartesian coordinates y' so that 

ds 2 = dy'dy' (30.12) 

Clearly, the reduction of ds 2 to this simple form is only possible because the 
functions g tj satisfy certain conditions. Conversely, the satisfaction of these 
conditions by the g tJ will guarantee that coordinates y 1 exist for which ds 2 takes 
the simple form (30.12) and hence that the space is Euclidean. However, in 
extending the theory of tensors to be applicable to curvilinear coordinate frames, 
we shall, at a certain stage, make use of the fact that ds 2 is expressible in the form 
(30.1 1), but no use will be made of the conditions satisfied by the coefficients g tJ 
which are a consequence of the space being Euclidean. It follows that the 
extended theory will be applicable in a hypothetical /V-dimensional space for 
which the 'distance' ds between neighbouring points x‘, x' +dx' is given by an 
equation (30. 1 1 ) in which the g, t are arbitrary functions of the x'.t Such a space is 

* The coordinates are here distinguished by superscripts instead of subscripts for a reason which will 
be given later. 

t Except that partial derivatives of the will be assumed to exist and to be continuous to any order 
required by the theory. 
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said to be Riemannim and will be denoted by & N . S N is a particular StN for which 
the g it satisfy certain conditions. The right-hand member of equation (30.1 1) is 
termed the metric of the Riemannian space. 

The surface of the Earth provides an example of an & 2 . If 9 is the co-latitude 
and 4> is the longitude of any point on the Earth’s surface, the distance ds between 
the points ( 0 , 4 > ), ( 0 + d0, <f> + d<£) is given by 

ds 2 = R 2 (d0 2 + sin 2 6d<t> 2 ) (30.13) 

where R is the earth’s radius. For this space and coordinate frame, the g :j take the 
form 

0u=£ 2 , 0 h = 9n=0, g 22 = R 2 sin 2 0 (30.14) 

It is not possible to define other coordinates (x, y) in terms of which 

ds 2 = dx 2 + dy 2 (30.15) 

over the whole surface, i.e. this &t 2 is not Euclidean. However, the surfaces of a 
right circular cylinder and cone are Euclidean; the proof is left as an exercise for 
the reader. 

It will be proved in Chapter 6 that, in the presence of a gravitational field, 
space-time ceases to be Euclidean in Minkowski’s sense and becomes an 9i x . This 
is our chief reason for considering such spaces. However, we can generalize the 
concept of the space in which our tensors are to be defined yet further. Until 
section 37 is reached, we shall make no further reference to the metric. This 
implies that the theory of tensors, as developed thus far, is applicable in a very 
general N-dimensional space in which it is assumed it is possible to set up a 
coordinate frame but which is not assumed to possess a metric. In such a 
hypothetical space, the distance between two points is not even defined. It will be 
referred to as ,9' N . 9i N is a particular N for which a metric is specified. 

31. Contravariant and covariant tensors 

Let x 1 be the coordinates of a point P in , < f N relative to a coordinate frame which is 
specified in some manner which does not concern us here. Let x 1 be the 
coordinates of the same point with respect to another reference frame and let 
these two systems of coordinates be related by equations 

x‘ = x‘(x', x 2 , . . . , x N ) (31.1) 

Consider the neighbouring point P' having coordinates x‘ + dx' in the first frame. 
Its coordinates in the second frame will be x' + dx', where 

dx' = ~- i dx J (31.2) 

ex' 

and summation with respect to the index j is understood. The N quantities dx' are 
taken to be the components of the displacement vector PP' referred to the first 
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frame. The components of this vector referred to the second frame are, 
correspondingly, the dx' and these are related to the components in the first frame 
by the transformation equation (31.2). Such a displacement vector is taken to be 
the prototype for all comravariani vectors. 

Thus, A‘ are said to be the components of a contra variant vector located at the 
point x‘, if the components of the vector in the 'barred’ frame are given by the 
equation 

= 1' (31-3) 

cx ’ 

It is important to observe that, whereas in Chapter 2 the coefficients a tJ occurring 
in the transformation equation (10.2) were not functions of the Cartesian 
coordinates x, so that the vector A was not, necessarily, located at a definite point 
in S s , the coefficients cx'/cx‘ in the corresponding equation (31.3) are functions 
of the x 1 and the precise location of the vector A' must be known before its 
transformation equations are determinate. This can be expressed by saying that 
there are no free vectors in ■9’ s . 

The form of the transformation equation (31.3) should be studied carefully. It 
will be observed that the dummy index j occurs once as a superscript and once as a 
subscript (i.e. in the denominator of the partial derivative). Dummy indices will 
invariably occupy such positions in all expressions with which we shall be 
concerned. Again, the free index i occurs as a superscript on both sides of the 
equation. This rule will be followed in all later developments, i.e. a free index will 
always occur in the same position (upper or lower) in each term of an equation. 
Finally, it will assist the reader to memorize this transformation if he notes that 
the free index is associated with the ’barred’ symbol on both sides of the equation. 

A contra variant vector A' may be defined at one point ofy v only. However, if it 
is defined at every point of a certain region, so that the A' are functions of the x‘, a 
comravariani vector field is said to exist in the region. 

If V is a quantity which is unaltered in value when the reference frame is 
changed, it is said to be a scalar or an invariant in V v Its transformation equation 
is simply 

V = V (31.4) 

Since this equation involves no coefficients dependent upon the x 1 , the possibility 
that V may be a free invariant exists. However, Fis more often associated with a 
specific point in V v and may be defined at all points of a region of y v , in which 
case it defines an invariant field. In the latter case 

F = F(x‘, x 2 x v | (31.5) 

V will then, in general, be a quite distinct function of the x'. If, however, in this 
function we substitute for the x‘ in terms of the x‘ from equation (31.1). by 
equation (31.4) the right-hand member of equation (31.5) must result. Thus 

F(x',x 2 x s ) = F(x‘,x 2 x v ) 


(31.6) 
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V being an invariant field, consider the N derivatives c V/cx‘. In the x'-frame, 
the corresponding quantities are cV/cx‘ and we have 

dV cV dx 1 dx 1 cV 

= =r|7q = T ci7-j (31.7) 

ox ox J ox ex ex 1 

since, by equation (31.6), when V is expressed as a function of the x 1 it reduces to 
V. As in Chapter 2, the c V/cx‘ are taken to be the components of a vector called 
the gradient of Cand denoted by grad V. However, its transformation law (3 1.7) is 
not the same as that for a contravariant vector, viz. (31.3), and it is taken to be the 
prototype for another species of vectors called covariant vectors. 

Thus, B, is a covariant vector if 

B,=^B, (31.8) 

ex 


Covariant vectors will be distinguished from contravariant vectors by writing 
their components with subscripts instead of superscripts. This notation is 
appropriate, for cV/dx' is a covariant vector and the index i occurs in the 
denominator of this partial derivative. The vector dx', on the other hand, has been 
shown to be contravariant in its transformation properties and this is correctly 
indicated by the upper position of the index. This is the reason for denoting the 
coordinates by x‘ instead of x„ although it must be clearly understood that the x' 
alone are not the components of a vector at all. 

The reader should check that the three rules formulated above in relation to the 
transformation equation (31.3), apply equally to the equation (31.8). 

The generalization from vectors to tensors now proceeds along the same lines 
as in section 10. Thus, if A\ B‘ are two contravariant vectors, the N 2 quantities 
/4'B J are taken as the components of a contravariant tensor of the second rank. Its 
transformation equation is found to be 

('v 1 ( X j 

A'B> = — yi A k B‘ (31.9) 

ex cx 


Any set of N 2 quantities C ' 1 transforming in this way is a contravariant tensor. 

Again, if A‘, B j are vectors, the first contravariant and the second covariant, 
then the N 2 quantities A'B i transform thus: 


A'B, 


dx cx 1 


(31.10) 


Any set of N 2 quantities C) transforming in this fashion is a mixed tensor, i.e. it 
possesses both contravariant and covariant properties as is indicated by the two 
positions of its indices. 

Similarly, the transformation law for a covariant tensor of rank 2 can be 
assembled from the law for covariant vectors. 

The further generalization to tensors of higher rank should now bean obvious 
step. It will be sufficient to give one example. A‘ jk is a mixed tensor of rank 3, 
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having both the covariant and contravariant properties indicated by the positions 
of its indices, if it transforms according to the equation 


— . cx cx cx 

A — A' 

lk cx r cx 1 dx k sl 


(31.11) 


The components of a tensor can be given arbitrary values in any one frame and 
their values in any other frame are then uniquely determined by the transform- 
ation law. Consider the mixed second rank tensor whose components in the 
x'-frame are 6), the Kronecker deltas (<5‘- = 0, i f j and <5‘ = 1,1=7). The 
components in the x'-frame are <Tj, where 


dx‘ ex' .. 

— —.ti 

cx cx J 


dx‘ CX* 
cx* cx j 


ex' 

e¥ j 




(31.12) 


Thus this tensor has the same components in all frames and is called the 
fundamental mixed tensor. However, a second rank covariant tensor whose 
components in the x'-frame are the Kronecker deltas (in this case denoted by <5,j), 
has different components in other frames and is accordingly of no special interest. 

It is reasonable to enquire at this stage why the distinction between covariant 
and contravariant tensors did not arise when the coordinate transformations 
were restricted to be orthogonal. Thus, suppose that A \ B, are contravariant and 
co variant vectors with respect to the orthogonal transformation (8. 1 ). The inverse 
transformation has been shown to be equation (1 1.5) and it follows from these 
two equations that 


cx, cx, 

y- = ~ 

CX, CX; 


(31.13) 


For the particular case of orthogonal transformations, therefore, equations (31.3) 
and (31.8) take the form 


A‘ = a,jA\ B, = a iJ B 1 (31.14) 

It is clear that both types of vector transform in an identical manner and the 
distinction between them cannot, therefore, be maintained. 

As in the case of the Cartesian tensors of Chapter 2, new tensors may be formed 
from known tensors by addition (or subtraction) and multiplication. Only 
tensors of the same rank and type may be added to yield new tensors. Thus, if A‘ Jk , 
B) k are components of tensors and we define the quantities C‘ Jk by the equation 

C) k = A‘ jk + B) k (31.15) 
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then C) k are the components of a tensor having the covariant and contravariant 
properties indicated by the position of its indices. However, A), B h cannot be 
added in this way to yield a tensor. Any two tensors may be multiplied to yield a 
new tensor. Thus, if A‘ p are tensors and we define N $ quantities C by the 
equation 

Cj L = A)B k lm (31.16) 

these are the components of a fifth rank tensor having the covariant and 
contravariant properties indicated by its indices. The proofs of these statements 
are left for the reader to provide. 

If a tensor is symmetric (or skew-symmetric) with respect to two of its 
superscripts or to two of its subscripts in any one frame, then it possesses this 
property in every frame. The method of proof is identical with that of the 
corresponding statement for Cartesian tensors given in section 10. However, if 
Aj = Aj is true for all i,j when one reference frame is being employed, this 
equation will not, in general, be valid in any other frame. Thus, symmetry (or 
skew-symmetry) of a tensor with respect to a superscript and a subscript is not, in 
general, a covariant property. The tensor <5‘ is exceptional in this respect. 

Another result of great importance which may be established by the same 
argument we employed in the particular case of Cartesian tensors, is that an 
equation between tensors of the same type and rank is valid in all frames if it is 
valid in one. This implies that such tensor equations are covariant (i.e. are of in- 
variable form) with respect to transformations between reference frames. The useful- 
ness of tensors for our later work will be found to depend chiefly upon this property. 

A symbol such as A‘ jk can be contracted by setting a superscript and a subscript 
to be the same letter. Thus A A‘ ik are the possible contractions of A' Jk and each, by 
the repeated index summation convention, represents a sum. Since in the symbol 
Aj p j alone is a free index, this entity has only N components. Similarly A' lk has N 
components. It will now be proved that, if A) k is a tensor, its contractions are also 
tensors. Specifically, we shall prove that Bj = A ) f is a covariant vector. For 

_ _. ex' ex* ex' . 

B i = = 

cx ex 1 ex 


_ (ex' cx‘\cx s 
v ex' cx r ldx J sl 


= — - A’ 
ex’ ^x' 51 

- 


CX , 

= AL 


(31.17) 
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establishing the result. This argument can obviously be generalized to yield the 
result that any contracted tensor is itself a tensor of rank two less than the tensor 
from which it has been derived and of the type indicated by the positions of its 
remaining free indices. In this connection it should be noted that, if A‘ jk is a tensor, 
A'jj is not, in general, a tensor; it is essential that the contraction be with respect to 
a superscript and a subscript and not with respect to two indices of the same kind. 

If A) k , are tensors, the tensor A‘ jk B' is called the outer product of these two 
tensors. If this product is now contracted with respect to a superscript of one 
factor and a subscript of the other, e.g. A‘ jk B', the result is a tensor called an inner 
product. 


32. The quotient theorem. Conjugate tensors 

It has been remarked in the previous section that both the outer and inner 
products of two tensors are themselves tensors. Suppose, however, that it is 
known that a product of two factors is a tensor and that one of the factors is a 
tensor, can it be concluded that the other factor is also a tensor? We shall prove 
the following quotient theorem: 

If the result of taking the product ( outer or inner ) of a given set of elements with a 
tensor of any specified type and arbitrary components is known to be a tensor, then 
the given elements are the components of a tensor. 

It will be sufficient to prove the theorem true for a particular case, since the 
argument will easily be seen to be of general application. Thus, suppose the A ‘ jk are 
N 3 quantities and it is to be established that these are the components of a tensor 
of the type indicated by the positions of the indices. Let B r s be a mixed tensor of 
rank 2 whose components can be chosen arbitrarily (in any one frame only of 
course) and suppose it is given that the inner product 

A‘,1% = C‘ s (32.1) 

is a tensor for all such B\. All components have been assumed calculated with 
respect to the x-frame. Transforming to the x-frame, the inner product is given to 
transform as a tensor and hence we have 

A$B)=C i p (32.2) 

where A)* are the actual components replacing the A‘ jk when the reference frame is 
changed. Let A ' Jk be a set of elements defined in the x-frame by equation (31.11). 
Since this is a tensor transformation equation, we know that the elements so 
defined will satisfy 

A‘ k B ; = C'i (32.3) 

Subtracting equation (32.3) from (32.2), we obtain 

[A‘* k - A‘ k )B k , = 0 (32.4) 

Since B^ has arbitrary components in the x-frame, its components in the x-frame 
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are also arbitrary and the components Bj can assume any convenient values. 
Thus, taking Bj = 1 when k = K and B* = 0 otherwise, equation (32.4) yields 

a'jk — a ; K = 0 

or Ajk — A' )K (32.5) 

This being true for K = 1,2 iV, we have quite generally 

A)t = A‘ Jk (32.6) 

This implies that A) k does transform as a tensor. 

We will first give a very simple example of the application of this theorem. Let 
A‘ be an arbitrary contravariant vector. Then 

d)A 1 = A‘ (32.7) 

and since the right-hand member of this equation is certainly a vector, by the 
quotient theorem Sj is a tensor (as we have proved earlier). 

Asa second example, let g tJ be a symmetric covariant tensor and let cj = |r/, v | be 
the determinant whose elements are the tensor’s components. We shall denote 'by 
G‘ J the co-factor in this determinant of the element g tj . Then, although G‘ J is not a 
tensor, if g =* 0, G iJ /g = g‘ J is a symmetric contravariant tensor. To prove this, we 
first observe that 

= y%, t),jG ik = y&i ( 32 . 8 ) 

and hence, dividing by g. 

= <3f. 132.9) 

Now let A 1 be an arbitrary contravariant vector and define the covariant vector B, 
by the equation 

B, = y lk A k (32.10) 

Since g » 0, when the components of B, are chosen arbitrarily, the corresponding 
components of A' can always be calculated from this last equation, i.e. B, is 
arbitrary with A'. But 

g‘ J B, — y' J g lk A k — 6 J k A k — A 1 (32.11) 

having employed the second identity (32.9). It now follows by the quotient 
theorem that g' J is a contravariant tensor. That it is symmetric follows from the 
circumstance that G‘ J possesses this property. g, r g‘ J are said to be conjugate to 
one another. 

33. Covariant derivatives. Parallel displacement. Affine connection 

In the earlier sections of this chapter, the algebra of tensors was established and it 
is now time to explain how the concepts of analysis can be introduced into the 
theory. Our space .9%- has N dimensions, but is otherwise almost devoid of special 
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characteristics. Nonetheless, it has so far been able to provide all the facilities 
required of a stage upon which the tensors are to play their roles. It will now be 
demonstrated, however, that additional features must be built into the structure 
of ,9V before it can function as a suitable environment for the operations of 
tensor analysis. 

It has been proved that, if cp is an invariant field, d<p/dx‘ is a covariant vector. 
But, if a covariant vector is differentiated, the result is not a tensor. For, let A { be 
such a vector, so that 

cx 11 

(33.1) 

ex' 

Differentiating both sides of this equation with respect to x\ we obtain 

dA t _ 8x k dx‘ cA k 8 2 x k 
£x J dx' dx j dx 1 dx'dx J k 

The presence of the second term of the right-hand member of this equation 
reveals that dAJcx 1 does not transform as a tensor. However, this fact can be 
arrived at in a more revealing manner as follows: 

Let P, P' be the neighbouring points x‘, x'+dx' and let A t , /1,+d/l, be the 
vectors of a covariant vector field associated with these points respectively. The 
transformation laws for these two vectors will be different, since the coefficients of 
a tensor transformation law vary from point to point in Sf N . It follows that the 
difference of these two vectors, namely d/1,, is not a vector. However, 

£A 

d/f,=^-!dx j (33.3) 

vx J 

and, since dx J is a vector, if A itj were a tensor, d A t would be a vector, /(^ cannot 
be a tensor, therefore. The source of the difficulty is now apparent. To define /l, j 
it is necessary to compare the values assumed by the vector field A t at two 
neighbouring, but distinct, points and such a comparison cannot lead to a tensor. 
If, however, this procedure could be replaced by another, involving the 
comparison of two vectors defined at the same point, the modified equation (33.3) 
would be expected to be a tensor equation featuring a new form of derivative 
which is a tensor. This leads us quite naturally to the concept of parallel 
displacement. 

Suppose that the vector A, is displaced from the point P, at which it is defined, 
to the neighbouring point P’, without change in magnitude or direction, so that it 
may be thought of as being the same vector now defined at the neighbouring 
point. The phrase in italics has no precise meaning in 6f s as yet, for we have not 
defined the magnitude or the direction of a vector in this space. However, in the 
particular case when if N is Euclidean and rectangular axes are being employed, 
this phrase is, of course, interpreted as requiring that the displaced vector shall 
possess the same components as the original vector. But even in S N , if curvilinear 
coordinates are being used, the directions of the curvilinear axes at the point P' 
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will, in general, be different from their directions at P and, as a consequence, the 
components of the displaced vector will not be identical with its components 
before the displacement. In y v , therefore, components of the displaced vector 
will be denoted by A i + dA,. This vector can now be compared with the field vector 
Aj +d/4, at the same point P'. Since the two vectors are defined at the same point, 
their difference is a vector at this point, i.e. dA { - <5,4,, is a vector. The modified 
equation (33.3) is accordingly expected to be of the form 

dA,-dA, = A r] d.x 1 (33.4) 

where A,- tJ is the appropriate replacement for A, . Since dx J is an arbitrary vector 
and the left-hand member of equation (33.4) is known to be a vector, A lf will, by 
the quotient theorem, be a covariant tensor. It will be termed the covariant 
derivative of A t . Thus, the problem of defining a tensor derivative has been re- 
expressed as the problem of defining parallel displacement (infinitesimal) of a 
vector. 

We are at liberty to define the parallel displacement of A, from P to P' in any 
way we shall find convenient. However, to avoid confusion, it is necessary that the 
definition we accept shall be in conformity with that adopted in <? v , which is a 
special case of y v . Suppose, therefore, that our ,9’s is Euclidean and that y' are 
rectangular Cartesian coordinates in this space. Let B, be the components of the 
vector field A { with respect to these rectangular axes. Then 


A,= 




(33.5) 


If the parallel displacement of the vector A, to the point P' is now carried out, its 
Cartesian components B i will not change, i.e. <5B, = 0. Hence, from the first of 
equations (33.5), we obtain 





dx*B, 


(33.6) 


Substituting for Bj into this equation from the second of equations (33.5), we find 


that 

SA t 

: = TUdx‘ 

(33.7) 

where 

r l 

1 tk 

c 2 y J ex' 

1 ^ j 

rxrx cr 

(33.8) 


This shows that, in <? v , the <5.4, are bilinear forms in the A, and dx*. In// v , we shall 
accordingly define the 6A t by the equation (33.7), determining the N 3 quantities 
Tin arbitrarily at every point of if s * This set of quantities T| t is called an affinity 


* Subject to the requirement that the r' (k are continuous functions ol the a' and possess continuous 
partial derivatives to the order necessary to validate all later arguments. 
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and specifies an affine connection between the points of .9' s . A space which is 
affinely connected possesses sufficient structure to permit the operations of tensor 
analysis to be carried out within it. 

For we can now write 

d/1, - <M, = C ^'dx J - T‘ Affix- 1 

CX J 



(33.9) 


But, as we have already explained, the left-hand member of this equation is a 
vector for arbitrary dx J and hence it follows that 


A 


i -i 


c_A, 

ex* 


(33.10) 


is a covariant tensor, the covariant derivative of A,. 

It will be observed from equation (33.10) that, if the components of the affinity 
all vanish over some region of .‘/ N , the covariant and partial derivatives are 
identical over this region. However, this will only be the case in the particular 
reference frame being employed, in any other frame the components of the 
affinity will, in general, be non-zero and the distinction between the two 
derivatives will be maintained. In tensor equations which are to be valid in every 
frame, therefore, only covariant derivatives may appear, even if it is possible to 
find a frame relative to which the affinity vanishes. 

We have stated earlier that, when defining an affine connection, the com- 
ponents of an affinity may be chosen arbitrarily. To be precise, a coordinate frame 
must first be selected in ,9^ and the choice of the components of the affinity is then 
arbitrary within this frame. However, when these have been determined, the 
components of the affinity with respect to any other frame are, as for tensors, 
completely fixed by a transformation law. We now proceed to obtain this 
transformation law for affinities. 


34. Transformation of an affinity 


The manner in which each of the quantities occurring in equation (33.10) 
transforms is known, with the exception of the affinity Tjj. The transformation 
law for this affinity can accordingly be deduced by transformation of this 
equation. Relative to the x-frame, the equation is written 


A ' 

<J vx J 


r*A 


(34.1) 


_ dx r 
A i = —A r 


Since /!„ A i:j are tensors. 


(34.2) 
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cx s ex' 

: / = ^ i - - , A , ; 

CX CX 1 


obtain 

cA, c 2 x’ 

- 1 Sx' 

_ — ' + - . A t 

- r* — 

: J cx cx cx J 

II ^ — L 

CX 


(34.3) 


(34.4) 


Employing equation (33.10) to substitute for .4 and cancelling a pair of identical 
terms from the two members of equation (34.4), this equation reduces to 


ex’ ex' c 2 x r _. (V 

~ T' a A, = J-^A, - 
ex cx J cx cx J cx k 


(34.5) 


Since A, is an arbitrary vector, we can equate coefficients of A r from the two 
members of this equation to obtain 


— k cx' _ cx s ex' d 2 x r 
,J cx lc dx‘ cx J s, + Cx'cx' 


(34.6) 


Multiplying both sides of this equation by dx’/cx' and using the result 


ex’ 

ex' 

('x‘ 

^ — 


« — L 

( x r 

( X 

cx 


(34.7) 


yields finally 


*» — I ^ t » •> — / ** 2 r 

CX CX CX cx C X 

' , — - , " — i SI "E s , , — I *> — , 

cx' ex' < x J cx cx'cx’ 


(34.8) 


which is the transformation law for an affinity. 

It should be noted that, were it not for the presence of the second term in the 
right-hand member of equation (34.8), T* would transform as a tensor of the 
third rank having the covariant and contravariant characteristics suggested 
by the positions of its indices. Thus, the transformation law is linear in the 
components of an affinity but is not homogeneous like a tensor transformation 
law. This has the consequence that, if all the components of an affinity are zero 
relative to one frame, they are not necessarily zero relative to another frame. 
However, in general, there will be no frame in which the components of an affinity 
vanish over a region of .‘f s , though it will be proved that, provided the affinity is 
symmetric, it is always possible to find a frame in which the components all vanish 
at some particular point (see section 39). 

Suppose T‘, Tf* are two affinities defined over a region of .’f N . Writing down 
their transformation laws and subtracting one from the other, it is immediate that 


pit _ P** 

1 ‘j 1 ij 


cx k cx s 
ex' ex' 


% (H, - r*) 

r y j 


(34.9) 


i.e. the difference of two affinities is a tensor. However, the sum of two affinities is 
neither a tensor nor an affinity. It is left as an exercise for the reader to show. 
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similarly, that the sum of an affinity T^ and a tensor A * is an affinity. 

If r‘j is symmetric with respect to its subscripts in one frame, it is symmetric in 
every frame. For, from equation (34.8), 

_ dx k dx s ex' r dx k d 2 x r 
J ‘ ex' dx J ex’ + dx r cx J dx‘ 

_ dx k dx‘ dx s , dx k d 2 x r 
dx r dx 1 ox 1 ,s + ex' dx’dx J 
= (3 4 -10) 

where, at the first step, we have put rj s = T',. 

35. Covariant derivatives of tensors 

In this section, we shall extend the process of covariant differentiation to tensors 
of all ranks and types. 

Consider first an invariant field V. When V suffers parallel displacement from P 
to P', its value will be taken to be unaltered, i.e. h V = 0 in all frames. Hence 

1/ 

dV — SV = °— i dx‘ (35.1) 

ex' 

is the counterpart for an invariant of equation (33.4). It follows that 

K.= Ki < 35 - 2 ) 

i.e. the covariant derivative of an invariant is identical with its partial derivative or 
gradient. 

Now let B‘ be a contravariant vector field and A, an arbitrary covariant vector. 
Then A { B‘ is an invariant and, when parallel displaced from P to P', remains 
unchanged in value. Thus 

<5(4,B‘) = 0 

or <5/1,8' + AjdB' = 0 

and hence, by equation (33.7), 

A k dB k = —r k j A k dx J B‘ (35.3) 

But, since the A k are arbitrary, their coefficients in the two members of this 
equation can be equated to yield 

dB k =-r i )B i dx J (35.4) 

This equation defines the parallel displacement of a contravariant vector. The 
covariant derivative of the vector is now deduced as before: thus 

dB‘-<5B‘ = (~ + r i ‘B , W 


(35.5) 
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and since d.v J is an arbitrary vector and dB k - SB 1 is then known to be a vector, 

~ gA. 

B k = - + r (35.6) 

( x J 

is a tensor called the covariant derivative of B k . 

Similarly, if A) is a tensor field, we consider the parallel displacement of the 
invariant where B,.C J arc arbitrary vectors. Then, from 

SiA'jBtC*) = 0 (35.7) 

and equations (33.7) and (35.4), we deduce that 

6 A ■ = T^/lidx* - r, 4 ' /4‘ d.v 1 (35.8) 

It now follows that 

= + (35.9) 

is the covariant derivative required. 

The rule for finding the covariant derivative of any tensor will now be plain 
from examination of equation (35.9), viz., the appropriate partial derivative is first 
written down and this is then followed by ‘affinity terms'; the 'affinity terms' are 
obtained by writing down an inner product of the affinity and the tensor with 
respect to each of its indices in turn, prefixing a positive sign when the index is 
contravariant and a negative sign when it is covariant. 

Applying this rule to the tensor field whose components at every point are those 
of the fundamental tensor it will be found that 

s;.k = w - 1 v; = r * - n* = 0 (35. mi 

Thus, the fundamental tensor behaves like a constant in covariant differentiation. 

Finally, in this section, we shall demonstrate that the ordinary rules for the 
differentiation of sums and products apply to the process of covariant 
differentiation. 

The right-hand member of equation (35.9) being linear in the tensor A), it 
follows immediately that if 



C' = A\ + B) 

(35.11) 

then 

( ;\» = A ).k + B' l k 

(35.12) 

Now suppose that 

C = A)B J 

(35.13) 

Then 

c 

fC' _ 



= J-(A-B’) + r; k A'B’ 

(X ’ 



Then 
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cA' \ fi'B 1 

i + r; t a] - r;» -4; )B j + — j- + r/ t s r U; 


= Aj k B J + B j k Aj 

which is the ordinary rule for the differentiation of a product. 


(35.14) 


36. The Riemann-Christoffel curvature tensor 

If a rectangular Cartesian coordinate frame is chosen in a Euclidean space and 
if A 'are the components of a vector defined at a point Q with respect to this frame, 
then 6A‘ = 0 for an arbitrary small parallel displacement of the vector from Q. 
This being true for arbitrary A\ it follows from equation (35.4) that r‘ jk = 0 with 
respect to this frame at every point of S N . Suppose C is a closed curve passing 
through Q and that A‘ makes one complete circuit of C, being parallel displaced 
over each element of the path. Then its components remain unchanged 
throughout the motion and hence, if A‘ + AA' denotes the vector upon its return 
to Q, 

AA' = 0 (36.1) 

Since AA' is the difference between two vectors both defined at Q, it is itself a 
vector and equation (36.1) will therefore be a vector equation true in all frames. 
Thus, in <$ N , parallel displacement of a vector through one circuit of a closed curve 
leaves the vector unchanged. 

If, however, A' is defined at a point Q in an affinely connected space not 
necessarily Euclidean, it will no longer be possible, in general, to choose a 
coordinate frame for which the components of the affinity vanish at every point. 
As a consequence, if A' is parallel displaced around C, its components will vary 
and it is no longer permissible to suppose that upon its return to Q it will be 
unchanged, i.e. AA‘ ± 0. We shall now calculate A A' when A' is parallel displaced 
around a small circuit C enclosing the point P having coordinates x‘ (Fig. 7) at 
which it is initially defined. 



Fig. 7 
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Let U be any point on this curve and let x' + be its coordinates, the c,‘ being 
small quantities. V isa point on C near to (J and having coordinates x' + + d£'- 

When A‘ is displaced from U to V, its components undergo a change 


SA‘ = -T^/Td^ (36.2) 

where rj* and A 1 are to be computed at U. Considering the small displacement 
from P to U and employing Taylor’s theorem, the value of at U is seen to be 


cx 


(36.3) 


to the first order in the In this expression, the affinity and its derivative are to be 
computed at P. A 1 in equation (36.2) represents the vector after its parallel 
displacement from P to U, i.e. it is 


Ai-ri,A'? 


(36.4) 


where A 1 , A" and T/ ( are all to be calculated at the point P. To the first order in 4' 
therefore, equation (36.2) may be written 


SA‘ = - 


r; t A' + 


Ai ^jr- r * r fiA r )? 


d? 


Integrating around C, it will be found that 


(36.5) 


aa‘ = -r;^ j <J)dc;‘ + 


(r r 

Px* 


A J 4>e'd£* 


(36.6) 


where the dummy indices j, r have been interchanged in the final term of the right- 
hand member of equation (36.5). 


Now 



(36.7) 


Also 


so that 


d(£'£‘) = Afc'c*) = 0 



(36.8) 

(36.9) 


implying that the left-hand member of this equation is skew-symmetric in / and k. 
Since <f', d^ 1 are vectors, it is also a tensor. Denoting it by a“, we have 


a 


kl 



(c'dif k - £‘d<;') 


c 


(36.10) 
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and equation (36.6) then reduces to the form 

A/1' = j/lV (36.11) 

Apart from its property of skew-symmetry, a*' is arbitrary. Nonetheless, since it 
is not completely arbitrary, the quotient theorem (section 32) cannot be applied 
directly to deduce that the contents of the bracket in equation (36. 1 1 ) constitute a 
tensor. In fact, this expression is not a tensor. However, it is easy to prove that, if 
X‘ kl is skew-symmetric with respect to k, i and if V", defined by the equation 

r= X‘ kl a kl (36.12) 

is a vector for arbitrary skew-symmetric tensors a*' , then XJ,, is also a tensor. 

To prove this, let ft kl bean arbitrary symmetric tensor. Then the components of 
the tensor 


y u = a kl + p kl (36.13) 

are completely arbitrary, for, assuming k < /, 

y u = a kl + li u , y‘ k = - a"' + /?*' (36. 1 4) 

and it follows that the values of'/','/ 1 * can be chosen arbitrarily and then 

a*' = j(/' - -/*), /?“ = \(y u + >•'*) (36. 1 5) 

i.e., it is only necessary to fix the values of «*', /?*' in the cases k < I in order that the 
/' shall assume any specified values over the complete range of its superscripts, 
with the exception of the cases when two superscripts are equal. If the superscripts 
are equal, oc kl = 0 and /' = //*'. But these ft 11 are also arbitrary and hence so again 
are the /' with equal superscripts. 

Since /?*' is symmetric and X‘ k , is skew-symmetric, 

X i kl p k '=0 (36.16) 

Adding equations (36.12) and (36.16), we obtain therefore 


*1, '/' = Y‘ (36.17) 

But /' is an arbitrary tensor and hence, by the quotient theorem, X‘ kt is a tensor. 

The multiplier of a*' in equation (36. 1 1 ) is not skew-symmetric ink,/. However, 
it can be made so as follows: Interchange the dummy indices k, / in this equation 
to obtain 

AA'^jV^-^/lV* (36.18) 


Adding equations (36.1 1) and (36.18) and noting that a*' = - x“, it will be found 
that 


a/i‘ = d n*r', - n, r't +~^~ ?£ ) A>* kl 

\ 1 vx k ox 1 / 


(36.19) 
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The bracketed expression is now skew-symmetric in k, I and hence 

,~r' \ 

F 1 r r _ f 1 v +- i — \ a 1 

1 rk 1 il I ril il + . I - I I™ 

cx k cx 

is a tensor. A 1 being arbitrary, it follows that 


(36.20) 


r'P, fP 


B)ki = r^n, - r rl r jlc + ■ 1 - 

rx k 


(36.21) 


is a tensor. It is the Riemann-Christoffel curvature tensor. 

Equation (36.19) can now be written 

^A i = (36.22) 

If B‘ jkl is contracted with respect to the indices i and /, the resulting tensor is 
called the Ricci tensor and is denoted by R jk . Thus 

R jk = B) ki (36.23) 

This tensor has an important role to play in Einstein’s theory of gravitation. Since 
B‘ jk , is skew-symmetric with respect to the indices k and /, its contraction with 
respect to i and k yields only the Ricci tensor again in the form - R#. However, 
contraction with respect to the indices i and j yields another second rank tensor, 
viz. 


c _*n« 

— D m — r — - i 

ex cx 


(36.24) 


37. Metrical connection. Raising and lowering indices 

In this section we shall further particularize our space .¥ s by supposing it to be 
Riemannian. That is, we shall suppose that a ‘distance’ or interval d.s between two 
neighbouring points x\ x 1 + dx‘ is defined by the equation 

ds 2 = (y 1J dx , dx J (37.1) 

where the N 2 coefficients g u are specified in some coordinate frame at every point 
of It will be assumed, without loss of generality, that the are symmetric. 
Such a relationship between all pairs of adjacent points is called a metrical 
connection and the expression (37.1) for ds 2 is termed the metric. 

For any two neighbouring points, ds will be regarded as an invariant associated 
with them and the must accordingly transform so that this shall be so. Since 
dx'dx J is an arbitrary symmetric tensor, g ti is symmetric and ds 2 is an invariant, it 
follows by a modified quotient theorem similar to the one proved in section 36 
that g tJ is a tensor. It is called the fundamental covariant tensor. Thecontravariant 
tensor which is conjugate to g (J (see section 32), viz. g‘\ is termed the fundamental 
contravariant tensor. This exists only if g = |g,j| f 0, which we accordingly 
assume to be the case. 
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In the case when .# s is Euclidean, rectangular Cartesian coordinates y' can be 
defined and, in such a frame. g tJ = 6,j. Consider a contravariant vector having 
components A‘ in a general curvilinear x-frame and components B' in the y- 
frame. In the Cartesian frame, covariant and contravariant vectors are indis- 
tinguishable, so that it is natural to define covariant components for the vector by 
the equation 

B, = B' (37.2) 

In the x-frame, let A, be the components of the covariant vector B*. Then 

A, = y,jA‘ (37.3) 

This follows since (i) it is a tensor equation and (ii) it is valid in the y-frame in 
which it takes the form 

B, = = B' (37.4) 

If ,*, v is not Euclidean, equation (37.3) is taken to define the covariant 
components of a vector whose contravariant components are A'. This process of 
converting the contravariant components of a vector into covariant components 
is termed lowering the index. 

If B, is a covariant vector, its contravariant expression is determined by raising 
the index with the aid of the fundamental contravariant vector. Thus 

B‘ = g' i B J (37.5) 

For the notation to be consistent, it is necessary that if an index is first lowered 
and then raised, the original vector should again be obtained. This is seen to be the 
case for, if A, is formed from A' (equation (37.3)), the result of raising its subscript 
is (equation (37.5) ) 

g'’A ] = g'’g lk A k = h' k A k = A' (37.6) 

where equations (32.9) have been used in the reduction. Similarly, if an index is 
first raised and then lowered, the original covariant vector is reproduced. 

Any index of a tensor can now be raised or lowered in the obvious way. Thus, if 
A' l is a tensor, we define 

A) t = g jr A' [ (37.7) 

To allow for the possibility that indices may be raised or lowered during a 
calculation, it will be convenient to displace the subscripts to the right of the 
superscripts. It is also often helpful to keep a record of these operations by placing 
a dot in the gap resulting from the raising or lowering of an index. These 
conventions are illustrated in equation (37.7). 

Suppose an index of the fundamental tensor g tj is raised. The result is 

(fit = = 6) (37.8) 

i.e. the mixed fundamental tensor. The same tensor results when an index of g' 1 is 
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lowered. If both subscripts of are raised, the result is 

flVsy = .</"$ = y rs (37.9) 

Our notation is entirely consistent, therefore, and g i} , g‘ J , are taken to be the 
covariant, contravariant and mixed components respectively of a single funda- 
mental tensor. 

Consider the inner product of two vectors A'. B,. We have 

A'Bj = g^AjBj 
= AjfTB, 

= AjB j 

= A,B‘ (37.10) 

It is clear that the dummy index occurring in the expression for an inner product 
can be raised in one factor and lowered in the other without affecting the result. 
This is obviously valid for the inner product of any pair of tensors. 


38. Scalar prodncts. Magnitudes of vectors 

In Jf.v the magnitude of the displacement vector dx' is taken to be ds as given by 
equation (37.1). If A' is any other contravariant vector, it may be represented as a 
displacement vector and then its magnitude is the invariant A, where 

A 2 =g ij A i A> (38.1) 

This equation is accordingly taken to define the magnitude of A'. 

Raising and lowering the dummy indices in equation (38.1), we obtain the 
equivalent result 

A 2 = g ,J AiAj (38.2) 

It is natural to assume that the associated vectors A t , A' have equal magnitudes 
and hence A is also taken to be the magnitude of A,. Equation (38.2) indicates how 
this can be calculated directly from A ( . 

Since g^A 1 = ,4, and y‘ J Aj = A ', equations (38.1), (38.2)arealso both seen to be 
equivalent to the equation 

A 2 = A,A‘ (38.3) 

The scalar product of two vectors A, B is defined to be the invariant 

A B = AjB' = A'Bj = g, J A‘B i = g‘ i A i B J (38.4) 

It will be noted that A 2 = A- A (38.5) 

By analogy with ,, we now define the angle 0 between two vectors A, B to be 
such that 


ABcosO = A ■ B 


(38.6) 



108 


i.e. 


n A ' B ‘ 

cosO = — - - — - -r 


J[(A’A,)(B k B k )] 
If 0 = jti, the vectors are said to be orthogonal and 

A'B, = 0 


(38.7) 


(38.8) 


39. Geodesic frame. Christoffel symbols 


It is always possible to choose a coordinate frame in which the components of the 
metric tensor are stationary at an assigned point, i.e. cg ij /'cx k all vanish at the 
point. Such a frame is said to be geodesic at the point. 

For, if x‘ = a' are the coordinates of a point A in an x-frame, suppose we 
transform to a new frame by the quadratic transformation 

x* = a' + x‘ + ia i jk x J x k (39. 1 ) 

where x' = 0 at A and the constant coefficients a‘ jk are symmetric in the indices j, k 
(no loss of generality). Differentiation yields the equations 


fjl! 

<?x J ' 


S‘j + a‘j k x i . 


fix J cx k 


(39.2) 


In particular, at the point A 

ox' fi 2 x‘ 

e$ i = di ' fxl* = a ‘ jk 

The transformation equation for the metric tensor is 

_fix r fix s 
9>i ~ r?5c ,: fi i 7 9 rs 


(39.3) 


(39.4) 


Differentiating with respect to x\ we find that 

cTjij fi 2 x r dx s fix' fi 2 x s r;x r fix' dg rs ex' 
fix k fix k fix' fix 1 9rs + ex' cx t fix 1 9rs+ fix' fix 1 fix' fix k 

Substituting from equations (39.3), it follows that at the point A, 


(39.5) 


og„ , . , eg.j 

■%_ ^ikdrj ^jkQis -s ‘fc 


(39.6) 


Suppose, if possible, the coefficients a‘ jk are chosen so that fig ij lcx k = 0. Then, 
writing a\ k g rj = a, tj , it is necessary that 

<*,kj + a Jki = -cg ij /fix k (39.7) 

Cyclically permuting the indices i,j, k , two further equations are obtained, viz. 

Oji t + a kiJ = -figjjfix' (39.8) 

a k ji + a iJk = -figjfix' (39.9) 
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Since a lJk is symmetric in the indices i, /, by adding the last pair of equations and 
subtracting equation (39.7), we get 


«. jk = - i| 


( l g JH , ‘1^1 
ex' dx J 



-[*.*] 


(39.10) 


[ij, A:] is called the Christoflel symbol of the first kind. It is now easily verified that 
the condition (39.7) is satisfied provided a ijk is defined by equation (39.10) and, 
hence, that dgji}x k = 0 at A. 

[(/, A] is not a tensor, but its indices invariably behave like subscripts in any 
formula in which it occurs. It is symmetric in the indices i,j. 

We now deduce that 


ah = 9 k 'a ijt = - g k, [ij , r] = - (39.1 1) 

where {, k J } is Christofjel's symbol of the second kind. It is clearly obtained from the 
symbol of the first kind [ij, k] by raising the final index k. It, also, is not a tensor, 
but ij always behave as subscripts and k behaves as a superscript; it is symmetric 
in i and j. 

Suppose we next transform from the x-frame to a y-frame by a linear 
transformation 

x' = b)^ (39.12) 

where the frj are constants. Then dx‘ = b ) dy J and it is a well known result from 
algebra that the quadratic form g {J dx'dx J can be reduced to the diagonal form 

(dy 1 ) 2 + (dy 2 ) 2 + . . . + (dy v ) 2 (39.13) 

at the point A by proper choice of the b j (some of these coefficients may have to be 
given imaginary values). Let hij be the metric tensor in the y-frame. Then 


cx r (}x 5 

K = — — Sr, = b'b)g„. 
vy ' dy 1 


(39.14) 


and, in particular, /i (j = S tj at A. Differentiating the last equation, we obtain 


dh 




, . = b'b)b l k 
Oy~ ox 


(39.15) 


showing that dhij/dy k vanishes at A. Thus, in the y-frame, h t j is stationary with 
value djj at A. The implication is that, in a small neighbourhood of A, the 
coordinates y‘ will behave like rectangular Cartesian coordinates. The y-frame is 
the closest approximation to a rectangular Cartesian frame that can be fitted to 
the S# N in the neighbourhood of A. 

If the y-frame were exactly rectangular Cartesian (i.e. the space were 
Euclidean), in order that the law for parallel displacement of a vector, equation 
(33.7), should agree with the one usually adopted, it would be necessary to take all 
the components of the affinity to be zero in this frame. It is natural, therefore, to 
define the affinity at the point A of &l N so that, in the y-frame, its components are 
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zero. With this choice of affinity, in the y-frame covariant derivatives will reduce 
to ordinary partial derivatives at the point A; in particular, 

* M :i = ?y = 0 < 39 -l6) 

at A. But this equation is a tensor equation and, being valid in one frame, is 
accordingly valid in all frames. Thus, with this choice of affinity 

%. k = 0 (39.17) 

Assuming that the affinity is defined at all points of in this manner, the last 
equation will be valid throughout the space and in all frames. Since this affinity is 
clearly symmetric in the y-frame, it will be symmetric in all frames. 

Writing equation (39.17) out at length, we have 

- r> jr = o (39.18) 

Cyclically permuting the indices i, j, k to obtain two further equations, it now 
follows as from equations (39.7)— (39.9) that 

[•/.*]=«* rH, (39.19) 

Raising the index k, this gives 

= (39.20) 

The affinity determined by this equation in any frame will be called the metric 
affinity and will invariably be assumed in all later developments. 

Since equation (39.17) is valid using the metric affinity, the metric- tensor 
behaves like a constant with respect to covariant differentiation. Further, since 

g‘ i gtj = 6 , t (39.21) 

by taking the covariant derivative of both members of this equation, we obtain 

= 0 (39.22) 

Multiplying by g kr and summing with respect to k. we then find that 

= 0 (39.23) 

Thus, all forms of the metric tensor behave like constants under covariant 

differentiation. 

It is now clear that 

(»i i ^ , ).i = M i t (39.24) 

This shows that the lowering of an index followed by a covariant differentiation 
yields the same result as when these two processes are reversed. Similarly, it can be 
shown that the raising of an index and covariant differentiation are two processes 
which commute. 
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40. Bianchi identity 

If we choose a frame which is geodesic at a point x', c g tJ /<'x k will vanish at the 
point and the two ChristofFel symbols will therefore also vanish at the point. 
Thus, all the components of the affinity will vanish at the point and covariant 
derivatives will reduce to partial derivatives there. It then follows from equation 
(39.23) that the partial derivatives dg ,] l?x k all vanish at the point, also. 

In such a geodesic frame, therefore, 


Dl _ * 

CX" 




= JlIj! _ ?T/». 

cx m i lx* <1x m cx l 


(40.1) 


since the (but not their derivatives necessarily) all vanish at this point. 
Cyclically permuting the indices k, /, m in equation (40.1), we obtain 
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Jlm k ~ Cx k cx‘ 

rt 2 V‘ 

C 1 j, 

cx l dx m 

(40.2) 

p2 ri 

Dl _ f ‘ Jf 

D imk . 1 -> l ' m 

(x'cx 

c 2 T' 

1 jm 

1 k 

cx cx* 

(40.3) 


Addition of equations (40.1), (40.2), (40.3), yields the following identity 

+ = 0 (40.4) 


But this is a tensor equation and, having been proved true in the geodesic frame, 
must be true in all frames. Also, since the chosen point can beany point of t)f s , it is 
valid at all points of the space. It is the Bianchi identity. 


41. The covariant curvature tensor 

The components of B' ju are not all independent since the tensor is skew- 
symmetric in the indices k, /. In addition, however, ) the affinity is symmetric, it is 
easily verified from equation (36.21) that 

B'ju+ B‘ k ij + B' lJk = 0 (41.1) 

If the affinity is metrical, by lowering the contravariant index of the 
Riemann-Christoffel tensor, a completely covariant curvature tensor B iju is 
derived. This has a number of symmetry properties, one of which is obtained from 
our last equation immediately by lowering the index i throughout to give 

Bijki + Bikij ,+ B<t jk = 0 (41.2) 

Further such properties can be established by first calculating an expression for 
the tensor in a geodesic frame. Thus, in such a frame 
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B - a * - a (‘Ji-'l'A 

“•!*< SirBjU ^xrl ~^ k . J 

= ffir^S ■ S J ) - 0,7 ( ,“, (</"U*, S]) 

= -^1 [>*•*]) < 41 ' 3 ) 


since f-<f n /cx k = 0 in a geodesic frame. Using the result g„g rs = 6 ■ and substitut- 
ing for the ChristofTel symbols, we now find from equation (41.3) that 


1 / c 2 g u f: 2 y Jk c 2 g lk ? 2 y fi \ 

ljU 2\cx J cx k cx'cx 1 cx^x 1 Cx‘cx k ) 

(41.4) 

The following equations are now easily verified: 


Biju — ~ Bjiki 

(41.5) 

B,jki — —B,jik 

(41.6) 

Biju = B kuj 

(41.7) 


Being true in the geodesic frame, these tensor equations must be valid in all 
frames. Note that the tensor is skew-symmetric with respect to its first pair of 
indices and its last pair. 

Also lowering the superscript / throughout the Bianchi identity (40.4), we 
obtain 


B,,kt.m+ I + B„„k.l - 0 


(41.8) 


42. Divergence. The Lapiacian. Einstein’s tensor 

If the covariant derivative of a tensor field is found and then contracted with 
respect to the index of differentiation and any superscript, the result is called a 
divergence of the tensor. With respect to orthogonal coordinate transformations 
in<? v , the partial and covariant derivatives are identical and then this definition of 
divergence agrees with that given in section 12. 

From the tensor A \ two divergences can be formed, viz. 

div,>T\ = A‘\ , and div,/f' t = ,4'V, (42.1) 

A contravariant vector possesses one divergence only, which is an invariant. If the 
affinity is the metrical one, such a divergence is simply expressed in terms of 
ordinary partial derivatives thus: since a derivative of a determinant can be found 
by differentiating each row separately and summing the results, we deduce that 


cg_ 

tx j 


^ tk ^Qik ik 

’ — - = <70 

(X J 


Cx 1 


(42.2) 
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Since 


equation (42.2) reduces to 


= »»“([(/. *] + [*/.'']) = 2^1 i 01 




dx 


Hence 


1 d 


Jgdx 

Now let A' be a vector field. Its divergence is 


7< V») 


5/4' . .. 


i r , sA i a , , 


(42.3) 

(42.4) 

(42.5) 


(42.6) 


which is the expression required 

In particular, if the vector field is obtained from an invariant F by taking its 
gradient, we have 

<W 

A, = 


and hence 


dx‘ 

dV 

A ‘ = g ,J — 

dx J 


(42.7) 

(42.8) 


From equation (42.6), it now follows that the divergence of this vector is 


div grad V - V 2 V - — — — 
y/gdx' 


.(*•*£) 


(42.9) 


The right-hand member of this equation represents the form taken by the 
Laplacian of Fin a general Riemannian space. In S N , employing rectangular axes, 
g‘> = S ,J , g = 1 and thus 



V 2 F = 


d 2 V 
dx'dx' 


(42.10) 


which is its familiar form. 

We shall now calculate the divergence of the Ricci tensor R jk (equation (36.23) ). 
If the metric affinity is being employed, this tensor is symmetric, for 

R kl — B'kji = y' r Brkji = S" Bj irk = d'B, jk, 

= B' jkr = R lk 


(42.11) 
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having employed equations (4 1 .5) - (4 1 .7). Raising either index accordingly yields 
the same mixed tensor R {. If this is contracted, an invariant 

R = Rj (42.12) 

is obtained. R is called the c urvature scalar of s . 

The skew-symmetry of B ljk i with respect to its first two and its last two indices 
means that equation (41.8) can be writen 


B, ikl.n 

i ® tjtnl.k ^ 

(42.13) 

Multiplying through by g‘‘g jk , we then get 


g jk B', LKm 

-g jk B‘ imlk -g“B k „ k , = Q 

(42.14) 

and this is equivalent to 




-g ik R im .. k -g"R m , = 0 

(42.15) 

or R m 

-2R‘. t = 0 

(42.16) 

Thus 

= i cR’cx m 

(42.17) 

is the divergence of the Ricci tensor. 

Consider now the mixed tensor 




(42.18) 

Us divergence is 

r 

' cx 



Di i £R 



= 0 

(42.19) 


This is Einstein's Tensor. Its covariant and contravariant components are 

R.j-iy^R, R'‘-kg ,J R (42.20) 

respectively. 


43. Geodesics 

Let C be any curve constructed in a space 31 s having metric (37.1) and let s be a 
parameter defined on C such that, if s, s + ds are its values at the respective 
neighbouring points P, P' on C, then ds is the interval between these two points. If 
x' arc the coordinates of any point P on C, then the curve will be defined by 
parametric equations 

x* — x'(s) (43.1) 

Since dx‘ are the components of a vector and ds is an invariant, dx'/ds is a 
contravariant vector at P. Its magnitude is, by equation (38.1), 



(43.2) 
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and this is unity by equation (37.1). dx‘/dx is termed the unit tangent to the curve 
at P, its direction being that of the displacement dx 1 along the curve from P. 

Suppose C possesses the property that the tangents at all its points arc parallel, 
i.e. the curve's direction is constant over its whole length. This property is clearly 
quite independent of the coordinate frame being employed. In <$ 3 , such a curve 
would, of course, be a straight line. In ,# s , the curve will be called a geodesic. A 
geodesic is accordingly the counterpart of the Euclidean straight line in a 
Riemannian space. Suppose P, P’ are neighbouring points on a geodesic having 
coordinates x',x‘ + dx‘ respectively. If the unit tangent at P is parallel displaced to 
P", it will then be identical with the actual unit tangent at this point. Now, by 
equation (35.4), after parallel displacement from P to P\ the unit tangent has 
components 


dx‘ 

ds 



dx 1 

d.s 


dx-' 

d.s 


dx 1 


(43.3) 


But the actual unit tangent for the point P' has components 



■ 


dx' 

ds 


d.s 


The vectors (43.3) and (43.4) are identical provided 


d 2 x' dx'dx‘ 
d.s 2 + >l d.s d.s 


= 0 


(43.4) 


(43.5) 


If these equations arc satisfied at every point of the curve (43.1 ). it is a geodesic. 

The N equations (43.5) arc second-order differential equations for the 
functions x'(.s) and their solution will involve 2 N arbitrary constants. If A, B are 
two given points having coordinates x' = a‘, x* = b' respectively, the 2 N 
conditions that the geodesic must contain these points will, in general, determine 
the arbitrary constants. Hence there is, in general, a unique geodesic connecting 
every pair of points. However, in some cases, this will not be so. For example, the 
geodesics on the surface of a sphere ( ) are great circles and, in general, there are 
two great circle arcs joining two given points, a major arc and a minor arc. Also, if 
these points are dtamterically opposed to one another, there is an infinity of great- 
circle arcs connecting them. 

Since dx'/d.s is everywhere a unit vector, on a geodesic 


dx‘ dx' 
; ds ds 


(43.6) 


This must, accordingly, be a first integral of the equations (43.5). To show that this 
is the case, multiply equations (43.5) through by 2^ lr dx r /ds and sum with respect 
to i to obtain 


(43.7) 
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Now 

Also 


dx'd 2 .x' 

2y ‘ r d7 d?” 


ds 


dx' dx' 

r d7~d7 


d g ir dx 1 dx r 
d.s d.s d.s 


(43.8) 


dx'dx l dx' _,dx J dx‘dx r 

V ' Jk ds d.s d.s = 2| ik ' ^ ds ds ds 

r , , r i i dx J dx‘ dx r 

= ([yfc,r] + [ rkj]) — —- 


cg jr dx 1, dx J dx r 
cx k ds ds ds 
dg jr dx J dx r 
ds ds ds 


(43.9) 


By addition of equations (43.8) and (43.9), it will be seen that equation (43.7) can 
be expressed in the form 


d/ dx'dx'\ 
ds\'^' J ds d.s ) 


= 0 


(43.10) 


Upon integration, there results the first integral 

dx‘ dx J 

q.. — = constant 

ds ds 


(43.11) 


The constant of integration must, of course, be taken to be unity. 

The definition of a geodesic which has been given at the beginning of this 
section cannot be applied to the class of curves for which the interval ds between 
adjacent points vanishes. For such a curve, the parametric representation (43.1) is 
not appropriate and a unit tangent cannot be defined. Instead, suppose that a 
( 1 -1) correspondence is set up between the points of the curve and the values of an 
invariant /. in some interval /. 0 $/.$/.,, so that parametric equations for the 
curve can be written 

x' = x‘(/.) (43.12) 


It will be assumed that the derivatives dx'/d/. all exist at each point of the curve. 
These derivatives constitute a contravariant vector and this has zero magnitude 
for, since d.s = 0 along the curve. 




dx' dx' 
d/. dz. 


(43.13) 


This vector will be in the direction of the displacement vector along the curve dx‘ 
and will be called a zero tangent to the curve. The curve will be termed a null 
geodesic if the zero tangents at all points of the curve are parallel. This implies 
that, when the zero tangent at P is parallel displaced to the adjacent point P\ it 
must be parallel to the zero tangent at this latter point, and since the magnitudes 
of these two vectors at P' are the same, they will be taken to be identical. The 
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condition for this to be so is found, as before, to be 


dx 1 +r ' ik dl dA ° 


(43.14) 


These are, therefore, the equations of the null geodesics. It may now be shown, by 
an argument similar to that culminating in equation (43.1 1), that a first integral of 
these equations is 

dx‘ dx J 

tt- = constant (43.15) 

d/. d/. 


In this case the constant must be zero. 

Equation (43.5) may be put in an alternative form which is more convenient for 
particular calculations, as follows; Multiply through by 2g r( and sum with respect 
to i; the resulting equation is equivalent to 


Now 


d / dx'\ _dg ri dx‘ dx J dx* 

dl ( 2dri d7 ) " 2 "d7 dT + 9ri r ' ik ~d7 ~di = 0 


(43.16) 


and 


2 dg„ dx' _ ^£g ri dx* dx 1 
ds ds ?x k ds ds 

_ / <^9rj CJJrt \ dx' d lx* 

\ dx k cx j ) ds ds 


(43.17) 


2gr,r' t = [jk, r] 


<~9r, 09rk _ rffjt 
ex* cx J dx' 


Equation (43. 16) accordingly reduces to 


jd 

ds 



r'^dx* dx* 
ex' ds ds 


Equation (43.14) for a null geodesic may be expressed similarly. 


(43.18) 


(43.19) 


Exercises 5 

1. Aij is a covariant tensor. If B {j = A Jt , prove that B tj is a covariant tensor. 
Deduce that, if A u is symmetric (or skew-symmetric) in one frame, it is symmetric 
(or skew-symmetric) in all. (Hint: The equations A tj = Aj,,Ajj = - A j,are tensor 
equations.) 

2. (x, y, z) are rectangular Cartesian coordinates of a point P in <? 3 and (r, 6, <p ) 
are the corresponding spherical polars related to the Cartesians by equations 
(30.3). A is a contravariant vector defined at P having components (A x , A r , A 1 ) in 
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the Cartesian frame and components (A\ A 6 , A <t ) in the spherical polar frame. 
Express the polar components in terms of the Cartesian components. Ol, 02, 03 
are rectangular Cartesian axes such that P lies on 0 1 and 03 lies in the plane Oxy. 
If (A 1 , A 2 , A 3 ) are the components of A in this Cartesian frame, show that 

A 1 = A r , A 1 = rA e , A 3 = r sinO A * 

(Note: Assume the Cartesian axes are right-handed.) 

3. If Ai is a covariant vector, verify that B,, = A k j A ] t transforms like a 
covariant tensor. (This is curl A.) If A is the gradient of a scalar, verify that its curl 
vanishes. 

4. If A tj is a skew-symmetric covariant tensor, verify that 

B u k = A a.k + A jk , + A tlj 

transforms as a tensor. 

5. Assuming the transformation inverse to (31.1) exists, prove that each 

determinant |rx'/t'x-'|, is the reciprocal of the other. A' t is a mixed 

tensor with respect to this transformation. Show that the determinant | A‘j\ is an 
invariant. 

6. If I~E is an affinity, show that the torsion defined by 

JViiiVrfc) 

is a tensor. g tj is a symmetric tensor. Write down an expression for its covariant 
derivative g l)k . By considering this equation and two similar equations obtained 
by cyclic permutation of the indices ij, k, show that if the covariant derivative of 
g :j is to vanish identically, the affinity must be given by 

r’jt = { i \ i + T‘ jk + g ,r {T),g ik + T kr g sj ) 

1. Show that 

A K ,~ A ,., = A i-i~ A J ‘ 

provided the affinity is symmetric. 

8. Show that 

A r.ik — A ,.k, = B'ijkA r + (T^— r r jk )A ir 

and deduce that B' jk is a tensor and that covariant differentiations are 
commutative in a space for which B' jk = 0 and the affinity is symmetric. Obtain 
the corresponding result for a contravariant vector A 1 . 

9. Ai is defined at the point x 1 and is parallel displaced around a small contour 
enclosing the point. Prove that the increment in A t resulting from one circuit is 
given by 

A A,= —jBjj k A,x Jk 

where ot jk is defined by equation (36.10). 

10. The parameiric equations of a curve in are 

x' = x * ( t ) 
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t is an invariant parameter. A tensor A) is defined over a region containing the 
curve. P, P' are neighbouring points t, t + At on the curve and A/I ‘ is defined to be 
the difference between the actual value of the tensor at P' and the value of the 
tensor at P after it has been parallel displaced to P'. Prove that 


^=lim^ 

D( 4 ,^o At 



(D/I j/Dt is called the intrinsic derivative of the tensor along the curve.) 

11. Verify that { /*} transforms as an affinity. 

12. If Ajj is symmetric, prove that A {j k is symmetric in i and j. 

13. Show that the number of the components of B) kl which may be assigned 
values arbitrarily is, in general, i N 3 ( N — 1 ). If the affinity is symmetric, show that 
this number is %N 2 (N 2 - 1). (Hint: Use equation (41.1).) 

14. Show that the number of the components of B ijk , which may be assigned 
values arbitrarily is N 2 (N 2 — 1 )/ 1 2. (Hint: Use equations (41.2), (41.5), (41.6), 
(41.7).) 

15. By differentiating the equation 


with respect to x\ show that 


9**9 * = 


cg im 

cx‘ 


= -g mk g“ 


<~9jk 

vx‘ 


and hence that 

ox 

Deduce that g ij k = 0. 

16. If the affinity is the metric one, prove that 

Kjk = B Jki = — { j t } + p x jj>, x k \J 9 

Ski — B‘ti = 0 

(Hint: Employ equation (42.5).) Deduce that R jk is symmetric. 

17. If 9 , (p are co-latitude and longitude respectively on the surface of a sphere 
of unit radius, obtain the metric 

ds 2 = d0 2 + sin 2 0d$ 2 


for the surface. Show that the only non-vanishing three index symbols for this 
are 


{ 2 l 2 } = -sin 6 cos 0, { , 2 2 } = { 2 2 ,} = cotO 




Show also that the only non-vanishing components of B ijkl are 
B 1212 ~B i 22 i = S 2121 = —B 2 h 2 = sin 2 0 
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and that the components of the Ricci tensor are given by 

r 12 = R 21 =0, R,, = -l, R 22 =-sin 2 * 0. 

Prove that the curvature scalar is given by R = — 2. 

18. Employing equation (42.9), obtain expressions for V 2 V in cylindrical and 
spherical polars. 

19. In a certain coordinate system 




where <j > , ip are functions of position. Prove that B) u is a function of i (j only. If 
i p = -log(u,x‘) prove that 

R ik = B) ki = 0 

20. In the ^ 2 whose metric is 

, dr 2 + r 2 dfl 2 r 2 dr 2 

ds 2 — 1 71 ui ( r>a ) 

r 2 -a 1 (r 2 -a 2 ) 2 

prove that the differential equation of the geodesics may be written 

“’(as) 

where k 2 is a constant such that k 2 = 1 if, and only if, the geodesic is null. By 
putting r d0/dr = tan (p , show that if the space is mapped on a Euclidean plane in 
which r, 0 are taken as polar coordinates, the geodesics are mapped as straight 
lines, the null geodesics being tangents to the circle r = a. 

21. A 2-space has metric 

ds 2 = 9 ,i(^‘) ! +022<d * 2 ) 2 

where g u , g 2 i are functions of x 1 and x 2 . B iju is its covariant curvature tensor 
and Rjj is its Ricci tensor. Prove that 

R 12 = 0, R[ 1 922 = ^2201 l = #122 I 


If R = g‘ J R, ; 2 , show that R = 2B 1221 /(g u <? 22 ). Deduce that R i} = {Rg^. 

22. Prove that 

(i) ' 4 ° :i = 7 

(ii) X lj ,„ = 0 

provided X‘ J is skew-symmetric. Hence prove that, for any tensor A iJ 

A‘j — Aij 
H ■<! ~ H ■!> 



23. A curve C has parametric equations 

x' = x'(t) 
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and joins two points A and B. The length of the curve is defined to be 


8 B 



A A 


Write down the Euler conditions that L should be stationary with respect to all 
small variations from C and by changing the independent variable in these 
conditions from t to s, show that they are identical with equations (43.19). (This 
provides an alternative definition for a geodesic.) 

24. If Tj k is a symmetric affinity, show that 

r‘j k + 8‘jA k + d' k Aj 

is also a symmetric affinity. 

If Bj kl , B'jh are the Riemann-Christoffel curvature tensors relative to the 
affinities Tj k , r'J respectively, prove that 

B'jkl = B) kl +8' k Aji — <3)/4y k + £y(>l k , — A, k ) 
where A tJ = A,Aj - A, r 

Hence show that if A, is the gradient of a scalar, then 


Bju — B'i‘ — B’jn — B‘i,j 

25. Prove that the affinity transformations form a group. 

26. Prove that 


c , .. 8(b 

— k (V0) 2 = 2 g'‘^ , k 
cx cx 

27. Two metrics are defined in J* v , viz. 

ds 2 = J/iydx'd.x-', ds 2 = e°g,j dx : d.x J 

where a is a function of the x‘. If rj k , are metric affinities constructed from 
these metrics, prove that 


r'y k — r'j k + a j k 


where 


= \{8‘a k + d[aj-y jk g ir a, r ) 


Curvature tensors B‘ jkh B) u are constructed from these affinities. Prove that 
Bju = B‘j k i + Aj, k — A‘j k , + A‘ rk A r ji — A‘ rl A' k 
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Deduce that 

Rjk = R ,k + A t - /4J t , + A‘ rk A'j - A J r A' jk 

and show also that A' tj = {No j. 

28. Oblique Cartesian axes are taken in a plane. Show that the contravariant 
components of a vector A can be obtained by projecting a certain displacement 
vector on to the axes by parallels to the axes and the covariant components by 
projecting by perpendiculars to the axes. 

29. Define coordinates (r, 0)ona right circular cone having semi-vertical angle 
a so that the metric for the surface is 

d.s 2 = dr 2 + r 2 sin 2 a d<j> 2 . 

Show that the family of geodesics is given by 

r - a see (0 sin y — ft) 

where a, (i are arbitrary constants. Explain this result by developing the cone into 
a plane. 

30. An M v has metric 

d.s 2 = e* dx‘ d.\‘ 

where /. is a function of the x\ Show that the only non-vanishing ChristofTel 
symbols of the second kind are 



where = ck/vx'. Deduce that 

and that the scalar curvature of this space is given by 

R = ( /V — 1 )e ' ■*[/.„ + j( ,V — 2)x r /. r ] 
where = <~ 2 X/cx r ix r . 

31. 0 is the co-latitude and <p is the longitude on a unit sphere, so that the metric 
for the surface is 

ds 2 = d0 2 + sin 2 Odip 2 

The co variant vector <4, is taken with initial components (A, Y) and is carried, by 
parallel displacement, along an arc of length <f> sin x of the circle 0 = a. Show that 
the components of A, attain the final values 

A i = A cos ( 0 cos a ) + Y cosec a sm ( <p cos a ) 

A 2 = — A sin a sin (<j> cos a) + Y cos (4> cos a) 
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Verify that the magnitude of the vector A, is unaltered by the displacement. 
32, An Mi has metric 

ds 2 = /.dr 2 + r 2 (d0 2 + sin 2 9d(p 2 ) 

where /. is a function of r alone. Show that, along the geodesic for which 9 = j?t. 
dO.ds = 0 at s = 0, 



where r = b sect//. Interpret this result geometrically when /. = 1. 

33. y 1 (i = 1, 2, 3, 4) are rectangular Cartesian coordinates in <? 4 . Show that 

y 1 = R cos 0 
y 2 = R sin 6 cos <p 
y 3 = R sinO sirup cost p 
y 4 = R sin 0 sin ip sin t p 

are parametric equations of a hypersphere of radius R. If (9, 0, 0) are taken as 
coordinates on the hypersphere, show that the metric for this Mi is 

ds 2 = R 2 [d0 2 + sin 2 O(d0 2 +sin 2 0d0 2 )] 

Deduce that in this M t , 

B t 2 1 2 = R 2 sin 2 0, B 2323 = R 2 sin 4 9 sin 2 (p . 

3 t 3 i = R 2 sin 2 Osin 2 (p , 

all other distinct components being zero. Hence show that 

B,jtl — K ~ (/itdjk) 

where K = l/R 2 . (This is the condition for the space to be of constant 
Riemannian curvature K.) 

34. An Mi has metric 

ds 2 = sech 2 y(dx 2 + dy 2 ) 

Find the equation of the family of geodesics. 

0, <p are co- latitude and longitude respectively on the surface of a sphere of unit 
radius. Mercator's projection is obtained by plotting x, y as rectangular Cartesian 
coordinates in a plane, taking 

x = <p, y = log cot \0 

Calculate the metric for the spherical surface in terms of x and y and deduce that 
the great circles are represented by the curves 

sinh y = x sin (x + /?), 
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where a, fi are parameters, in Mercator’s projection. 

35. Obtain the formula 


Bjjki — 


1 


o z g,i 

2 V cx J Px k 


c 2 g jl 

cx'cx 1 ('x‘rx‘ Px'rx k J + ^ 


<' 2 9jk 


i: 2 g>k 


s 

J k 


■f/si 


r 

i k 


s 

j I 


36. An M 2 has metric ds 2 = 2<pdxdy, where 4>~<p(x,y). Calculate the 
component B, 2 1 2 of the curvature tensor and state the values of the remaining 1 5 
components. Deduce that the space is flat provided 

d 2 <p P(j>c^> 

<t> ^ T " T 
vxcy cx cy 

Putting (j> = e*, obtain the general form for cj> satisfying this condition. Deduce 
that coordinates <*, rj can be found such that the metric takes the form 
ds 2 = 2d£ dry 

37. If Aj is such that A r/ + A t , = 0, by cyclically permuting the indices ij, k in 
A,, k -A,; kt = A,B' ijk to give two further equations, prove that A, A - - A r B r kij . 

38. 4>(u, c)and i j/[u, u)are the real and imaginary parts of an analytic function 
/( w) of the complex variable w = u + iv. Show that the equations x = 0(u, v), 
) =(/( (u, t) transform the Pythagorean metric ds 2 = dx 2 + dy 2 into the metric 


ds 2 = 


/ c' 0 \ 2 /<?< 
\ dll ) \ r 


(du 2 + dr 2 ). 


By taking/(w) = I /w, explain how it is possible to write down the equation of the 
family of geodesics in a space whose metric is 

, du 2 + dr 2 

ds = — s s t 

(« 2 +r 2 ) 2 

Also obtain this equation by transforming the metric using the equations 
u = rcosO , r = rsinO, and writing down the differential equations for the 
geodesics in terms of r and 0. 

39. (x, y)are rectangular Cartesian coordinates and (r, G) are polar coordinates 
in a Euclidean plane. A , y is a symmetric tensor field defined in the plane by its 
components A xx = A yy = 0, A xy = A yx = x/y + y/x. Calculate the contravariant 
polar components of the field in terms of rand 0,and deduce that A" + r 2 A m = 0. 
(Ans. A" = 2, A ,e = 2cot2 6/r, A m = -2/r 2 .) 

40. x, y, 2 are rectangular Cartesian coordinates in Parametric equations 
for a hyperbolic paraboloid are taken in the form x = u + v, y = u — v, z = uv. A 
covariant tensor field on the surface is defined by the equations A uu = u 2 , A uv 
= A vu = -uv, A vv = v 2 . Show that the contravariant components are one- 
quarter the covariant components. 
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41. x, y are rectangular Cartesian coordinates in a Euclidean plane and u, Dare 
curvilinear coordinates defined by x = a cosh u cos d, y = a sinh u sin v. A covari- 
ant vector has components A x , A y at the point (x, y) and curvilinear components 
A u , A v . Show that 

2 

A x = - (/l u sinhu cos v - A c cosh u sin t)/(cosh 2u - cos 2v) 
a 

42. x, y are rectangular Cartesian coordiantes in a plane. Curvilinear 
coordinates u, v are defined by the transformation equations u = ^(x 2 — y 2 ), 
v = xy. Sketch the families of coordinate lines u = const., v = const, and show 
that the metric in the uoframe is 


ds 2 = j(u 2 + u 2 ) l ' 2 (du 2 +dr 2 ) 


A covariant vector has Cartesian components (A x ,A y ) and curvilinear com- 
ponents (A u , A x ). Show that 

A u = (xA x -yA,)/(x 2 +y 2 ) 


and derive the corresponding formula for A„. 

43. (x, > ) are rectangular Cartesian coordinates in a Euclidean plane and (u, v) 
are curvilinear coordinates defined by the equations x = i(u 2 +v 2 ), y = uv. A 
covariant vector field A { is defined over the plane in the uy-frame by the equations 

= A v = (u 2 -v 2 ) 2 . Show that its divergence is equal to 2 (m - v) 2 /(u + v). 
Calculate the contravariant vector at the point u = 0, v = 1 and use the law of 
parallel displacement along the curve u - 0 to calculate the parallel displaced 
vector at the point u = 0, v = 2. (Ans. A" = A v = j.) 

44. An St i has metric ds 2 = dx 2 + x 2 d>' 2 . Calculate the components of its 
metric affinity. Deduce that the divergence of the vector field whose covariant 
components are given by A x = x cos 2 y, A, = —x 2 sin 2>\ vanishes. 

45. If the x-frame is geodesic at the point x\ prove that 


c l x‘c , x J s t 


1.9 P (9sp.ijt 9 tp.ijs Qst.ijp ) 


ft, _»r ; 5 ! [ 

11 Lcx'cix ' 1 [j s J cx‘c“x s }/ 

s }-^ 
Lcx-'cx [l sj OX J C 


s 

x s | i k 

l_ V J 1 

and deduce that R j , = j. 

46. An St 2 has metric ds 2 = y 2 dx 2 + dy 2 . Deduce the parallel transfer 
equations 

<M X = - A x dy - yA y dx, SA y = -A x dx 

y y 
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Using these equations, parallel transfer the vector along the curve y = sec x from 
the point x = 0, y = 1 at which its components are A x = 0, A y = 1, to the point x 
= ti/3, y = 2. 

47. If 6, <p are latitude and longitude respectively on the surface of a globe of 
unit radius, show that the geodesics on the globe have equations tan0 
= tan a sin (if) + /?), where a, are constants. 

48. A space has metric ds 2 = sech 2 y(dx 2 4 -dy 2 ). A vector A f is parallel 
displaced from the point x = 0, y = b to the point x = a, y = b along the line 
y = b. Its initial components are (3f, y). Show that its final components are given 
by 

A t = X cos (a tanhb) + Tsin (a tanh 6) 

A 2 = - -Y sin (a tanh b) + Y cos ( a tanh b) 

49. Replacing A, in the argument of section 34 by A\ obtain the transform- 
ation law for an affinity in the form 

_ <5x'<3x s <3x‘ d 2 x‘ dx' dx s 

Jk dx r dx i vx k sl dx'dx* dx' dx k 


Prove that this is equivalent to equation (34.8). (Hint: Differentiate 


ex' ex' 

Jx’d^j 




50. Using the transformation law for an affinity in the form given in the last 
exercise, if r = 0*r' t show that an x-frame can always be found such that 
f = 0 everywhere. Show that this frame is determined by the equations 


.. d 2 r dx‘ 

q‘ k T — r = — r j 

dx ] dx k vx J 


and that these equations can be written V 2 x' = 0. (The coordinates x' are said to 
be harmonic.) 



CHAPTER 6 


General Theory of Relativity 


44. Principle of equivalence 

The special theory of relativity rejects the Newtonian concept of a privileged 
observer, at rest in absolute space, and for whom physical laws assume their 
simplest form, and assumes instead that these laws will be identical for all 
members of a class of inertial observers in uniform translatory motion relative to 
one another. Thus, alt hough the existence of a single privileged observer is denied, 
the existence of a class of such observers is accepted. This seems to imply that, if 
all matter in the universe were annihilated except for a single experimenter and his 
laboratory, this observer would, nonetheless, be able to distinguish inertial frames 
from non-inertial frames by the special simplicity which the descriptions of 
physical phenomena take with respect to the former. The further implication is, 
therefore, that physical space is not simply a mathematical abstraction which it is 
convenient to employ when considering distance relationships between material 
bodies, but exists in its own right as a separate entity with sufficient internal 
structure to permit the definition of inertial frames. However, all the available 
evidence suggests that physical space cannot be defined except in terms of 
distance measurements between physical bodies. For example, such a space can be 
constructed by setting up a rectangular Cartesian coordinate frame comprising 
three mutually perpendicular rigid rods and then defining the coordinates of the 
point occupied by a material particle by distance measurements from these rods in 
the usual way. Physical space is, then, nothing more than the aggregate of all 
possible coordinate frames. A claim that physical space exists independently of 
distance measurements between material bodies, can only be substantiated if a 
precise statement is given of the manner in which its existence can be detected 
without carrying out such measurements. This has never been done and we shall 
assume, therefore, that the special properties possessed by inertial frames must be 
related in some way to the distribution of matter within the universe and that they 
are not an indication of an inherent structure possessed by physical space when it 
is considered apart from the matter it contains. This line of argument encourages 
us to expect, therefore, that, ultimately, all physical laws will be expressible in 
forms which are quite independent of any coordinate frame by which physical 
space is defined, i.e. that physical laws are identical for all observers. This is the 
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general principle oj relativity. This does not mean that, when account is taken of 
the actual distribution of matter within the universe, certain frames will not prove 
to be more convenient than others. When calculating the field due to a 
distribution of electric charge, it simplifies the calculations enormously if a 
reference frame can be employed relative to which the charge is wholly at rest. 
However, this does not mean that the laws of electromagnetism are expressible 
more simply in this frame, but only that this particular charge distribution is then 
described more simply. Similarly, we shall attribute the simpler forms taken by 
some calculations when carried out in inertial frames, to the special relationship 
these frames bear to the matter present in the universe. Fundamentally, therefore, 
all observers will be regarded as equivalent and, by employing the same physical 
laws, will arrive at identical conclusions concerning the development of any 
physical system. 

The main difficulty which arises when we try to express physical laws so that they 
are valid for all observers is that, if test particles are released and their motions 
studied from a frame which is being accelerated with respect to an inertial frame, 
these motions will not be uniform and this fact appears to set such frames apart 
from inertial frames as a special class for which the ordinary laws of motion do 
not apply. However, by a well-known device of Newtonian mechanics, viz. the 
introduction of inertial forces, accelerated frames can be treated as though they 
were inertial and this suggests a way out of our difficulty. Thus suppose a space 
rocket, moving in vacuo, is being accelerated uniformly by the action of its 
motors. An observer inside the rocket will note that unsupported particles 
experience an acceleration parallel to the axis of the rocket. Knowing that the 
motors are operating, he will attribute this acceleration to the fact that his natural 
reference frame is being accelerated relative to an inertial frame. However he may, 
if he prefers, treat his reference frame as inertial and suppose that all bodies within 
the rocket are being subjected to inertial forces acting parallel to the rocket’s axis. 
If a is the acceleration of the rocket, the appropriate inertial force to be applied to 
a particle of mass m is - ma. Similarly, if the rocket’s motors are shut down but the 
rocket is spinning about its axis, an observer within the rocket will again note that 
free particles do not move uniformly relative to his surroundings and he may 
again avoid attributing this phenomenon to the fact that his frame is not inertial, 
by supposing certain inertial forces (viz. centrifugal and Coriolis forces) to act 
upon the particles. Now it is an obvious property of each such inertial force that it 
must cause an acceleration which is independent of the mass of the body upon 
which it acts, for the force is always obtained by multiplying the body's mass by an 
acceleration independent of the mass. This property it shares with a gravitational 
force, for this also is proportional to the mass of the particle being attracted and 
hence induces an acceleration which is independent of this mass. This indepen- 
dence of the gravitational acceleration of a particle and its mass has been checked 
experimentally with great accuracy by EtitvOs. If, therefore, we regard the 
equivalence of inertial and gravitational forces as having been established, inertial 
forces can be thought of as arising from the presence of gravitational fields. This is 
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the principle of equivalence. By this principle, in the case of the uniformly 
accelerated rocket, the observer is entitled to neglect his acceleration relative to an 
inertial frame, provided he accepts the existence of a uniform gravitational field of 
intensity - a parallel to the axis of the rocket. Similarly, the observer in the 
rotating rocket may disregard his motion and accept, instead, the existence of a 
gravitational field having such a nature as to account for the centrifugal and 
Coriolis forces. 

By appeal to the principle of equivalence, therefore, an observer employing a 
reference frame in arbitrary motion with respect to an inertial frame, may 
disregard this motion and assume, instead, the existence of a gravitational field. 
The intensity of this field at any point within the frame will be equal to the inertial 
force per unit mass at the point. By this device, every observer becomes entitled to 
treat his reference frame as being at rest and all observers accordingly become 
equivalent. However, the reader is probably still not convinced that the 
distinction between accelerated and inertial frames has been effectively elim- 
inated, but only that it has been concealed by means of a mathematical device 
having no physical significance. Thus, he may point out that the gravitational 
fields which have been introduced to account for the inertial forces are ‘fictitious’ 
fields, which may be completely removed by choosing an inertial frame for 
reference purposes, whereas ‘real’ fields, such as those due to the earth and sun, 
cannot be so removed. He may further object that no physical agency can be held 
responsible for the presence of a ‘fictitious’ field, whereas a ‘real’ field is caused by 
the presence of a massive body. These objections may be met by attributing such 
‘fictitious’ fields to the motions of distant masses within the universe. Thus, if an 
observer within the uniformly accelerated rocket takes himself to be at rest, he 
must accept as an observable fact that all bodies within the universe, including the 
galaxies, possess an additional acceleration of-a relative to him and to this 
motion he will be able to attribute the presence of the uniform gravitational field 
which is affecting his test particles. Again, the whole universe will be in rotation 
about the observer who regards himself and his space-ship as stationary when it is 
in rotation relative to an inertial frame. It is this rotation of the masses of the 
universe which we shall hold responsible for the Coriolis and centrifugal 
gravitational fields within the rocket. But, in addition, these ‘inertial’ gravitational 
fields will account for the motions of the galaxies as observed from the non- 
inertial frame. Thus, for the observer within the uniformly accelerated rocket a 
uniform gravitational field of intensity - a extends over the whole of space and is 
the cause of the acceleration of the galaxies; for the observer within the rotating 
rocket, the resultant of the centrifugal and Coriolis fields acting upon the galaxies 
is just sufficient to account for their accelerations in their circular orbits about 
himself as centre (the reader should verify this, employing the results of Exercises 
1, No. 1). On this view, therefore, inertial frames possess particularly simple 
properties only because of their special relationship to the distribution of mass 
within the universe. In much the same way, the electromagnetic field due to a 
distribution of electric charge lakes an especially simple form when described 



130 


relative to a frame in which all the charges are at rest (assuming such exists). If any 
other frame is employed, the field will be complicated by the presence of a 
magnetic component arising from the motions of the charges. However, this 
magnetic field is not considered imaginary because a frame can be found in which 
it vanishes, whereas for certain magnetic fields such a frame cannot be found. The 
laws of electromagnetism are taken to be valid in all frames, though it is conceded 
that, for solving particular problems, a certain frame may prove to be pre- 
eminently more convenient than any other. Neither, therefore, should the 
centrifugal and Coriolis fields be dismissed as imaginary solely because they can 
be removed by proper choice of a reference frame, although it may be convenient 
to make such a choice of frame when carrying out particular computations. In 
short, the general principle of relativity can be accepted as valid and, at the same 
time, the existence of the inertial frames accounted for by the simplicity of the 
motions of the galactic masses with respect to these particular frames. 

The notion that the existence of inertial frames is bound up with the large-scale 
distribution of matter within the universe is referred to as Mach's principle. 
Although Einstein was powerfully influenced by the principle when developing 
his general theory, he was disappointed to discover that it still permits the 
existence of universes in which local inertial frames are not in uniform non- 
rotatory motion relative to the overall matter distribution. The complete 
integration of Mach’s principle into the theory is yet to be accomplished. 

The previously unexplained identity of inertial and gravitational masses is 
easily deduced as a consequence of the principle of equivalence. For, consider a 
particle of mass m which is being observed from a non-inertial frame. A 
gravitational force equal to the inertial force will be observed to act upon this 
body. This force is directly proportional to the inertial mass m. But, by the 
principle of equivalence, all gravitational forces are of the same nature as this 
particular force and will, accordingly, be directly proportional to the inertial 
masses of the bodies upon which they act. Thus the gravitational ‘charge’ of a 
particle, measuring its susceptibility to the influences of gravitational fields, is 
identical with its inertial mass and the identity of inertial and gravitational masses 
has been explained in a straightforward and convincing manner. 

45. Metric in a gravitational field 

Suppose that a space-station in the shape of a wheel has been constructed in a 
region of space far from other attracting bodies and that it is set rotating in its 
plane about its centre with angular velocity o>. An observer O, wearing a space- 
suit, is located outside the station and does not participate in the rotary motion; 
his frame of reference is therefore inertial. O watches C, a member of the station’s 
crew, measuring the dimensions of the station using a metre rule. C first measures 
the radius of the station from its centre to its outer wall by laying his rule along 
one of the corridors forming a spoke of the wheel. O notes that the rule is moving 
laterally throughout the measuring process, but this motion does not affect its 
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length in his frame and he will accordingly agree with the radius r recorded by C. 
C next lays his rule around the outer wall of the station and records a perimeter p. 
During this process, however, O sees the rule moving longitudinally with velocity 
o>r and its length will be reduced by a factor 1 — a> 2 r 2 /c 2 ). He will accordingly 
correct the length of the perimeter found by C to the value — a> 2 r 2 /c 2 ). 
Since O’s frame is inertial. Euclidean geometry is valid for all space measurements 
referred to the frame and he must find that 

p ^/ (l — ro 2 r 2 /c 2 ) = 2nr (45.1) 

Thus p = 2w(l - w 2 r 2 /c 2 )~ l ' 2 (45.2) 

This last equation indicates that C will discover that the Euclidean formula 
p = 2nr is not valid for measurements made in the rotating frame of the space- 
station. But C is entitled to regard the station frame as being at rest, provided he 
accepts the existence of a gravitational field which will account for the centrifugal 
and Coriolis forces he experiences. We conclude that, relative to a frame at rest in 
such a gravitational field, spatial measurements will not be in conformity with 
Euclidean geometry. 

By the principle of equivalence, the conclusion which has just been reached 
concerning the non-Euclidean nature of space in which there is present a 
gravitational field of the centrifugal-Coriolis type, must be extended to all 
gravitational fields. However, in the case of a field such as that which surrounds 
the earth, it will not be possible (as it is for the centrifugal- Coriolis field) to find an 
inertial frame of reference relative to which the field vanishes and for which the 
spatial geometry is Euclidean. Such a field will be termed irreducible. Even in an 
irreducible field, however, a frame can always be found which is inertial for a 
sufficiently small region of space and a sufficiently small time duration. Thus, 
within a space-ship which is not rotating relative to the extragalactic nebulae and 
which is falling freely in the earth's gravitational field, free particles will follow 
straight-line paths at constant speed for considerable periods of time and the 
conditions will be inertial. A coordinate frame fixed in the ship will accordingly 
simulate an inertial frame over a restricted region of space and time and its 
geometry will be approximately Euclidean. 

Since a rectangular Cartesian coordinate frame can be set up only in a space 
possessing a Euclidean metric, this method of specifying the relative positions of 
events must be abandoned in an irreducible gravitational field (except over small 
regions as has just been explained). Instead, the positions and times of all events 
will be specified by reference to a very general type of frame which we can suppose 
constructed as follows: Imagine the whole of the cosmos is filled by a fluid whose 
motion is arbitrary but non-turbulent (i.e. particles of the fluid which are initially 
close together, remain in proximity to one another). Let each molecule of the fluid 
be a clock which runs smoothly, but not necessarily at a constant rate as judged by 
a standard atomic clock. No attempt will be made to synchronize clocks which are 
separated by a finite distance, but it will be assumed that, as this distance tends to 
zero, the readings of the clocks will always approach one another. Each clock will 
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be allocated three spatial coordinates £ 2 , £ 3 according to any scheme which 
ensures that the coordinates of adjacent clocks only differ infinitesimally. The 
coordinates £,* of a clock will be supposed never to change. Any event taking place 
anywhere in the cosmos can now be allocated unique space-time coordinates 

(,• = 1,2, 3, 4) as follows: (£\ ^ 2 ,4 3 )are the spatial coordinates belonging to the 
dock which happens to be adjacent to the event when it occurs, and £ 4 is the time 
shown on this clock at this instant. 

We shall now further generalize the coordinates allocated to an event. Let 
x‘ (i = 1, 2, 3, 4) be any functions of the such that, to each set of values of the 
there corresponds one set of values of the x', and conversely. We shall write 

x‘ = x‘(V (43.3) 

Then the x‘, also, will be accepted as coordinates, with respect to a new frame of 
reference, of the event whose coordinates were previously taken to be the £'. It 
should be noted that, in general, each of the new coordinates x‘ will depend upon 
both the time and the position of the event, i.e., it will not necessarily be the case 
that three of the coordinates x' are spatial in nature and one is temporal. All 
possible events will now be mapped upon a space so that each event is 
represented by a point of the space and the x' will be the coordinates of this point 
with respect to a coordinate frame. will be referred to as the space-time 
continuum. 

It has been remarked that, in any gravitational field, it is always possible to 
define a frame relative to which the field vanishes over a restricted region and 
which behaves as an inertial frame for events occurring in this region and 
extending over a small interval of time. Such a frame will be falling freely in the 
gravitational field and will accordingly be referred to as a local free-fall frame. 
Suppose, then, that such an inertial frame S is found for two contiguous events. 
Any other frame in uniform motion relative to S will also be inertial for these 
events. Observers at rest in all such frames will be able to construct rectangular 
Cartesian axes and synchronize their standard atomic clocks in the manner 
described in Chapter 1 and hence measure the proper time interval dr between the 
events. If, for one such observer, the events at the points having rectangular 
Cartesian coordinates (x, y, z), (x + dx, y + dy, z + dz) occur at the times t, t + dr 
respectively, then 

dr 2 = dt 2 --^(dx 2 + dy 2 + dz 2 ) (45.4) 

c 

The interval between the events ds will be defined by 

ds 2 = — c 2 dz 2 = dx 2 + dy 2 +dz 2 — c 2 dr 2 (45.5) 

The coordinates (x, y, z, t) of an event in this quasi-inertial frame will be related to 
the coordinates x‘ defined earlier, by equations 

x = x(x l , x 2 , x 3 , x 4 ), 


etc. 


(45.6) 
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and hence dx = dx‘, etc. (45.7) 

dx' 

Substituting for dx, dy, dz, dt in equation (45.5), we obtain the result 

ds 2 = <? ij dx'dx J (45.8) 

determining the interval ds between two events contiguous in space time, relative 
to a general coordinate frame valid for the whole of space -time. The space- time 
continuum can accordingly be treated as a Riemannian space with metric given by 
equation (45.8). 

As explained in section 7, dr can be timelike or spacelike according as it is real 
or imaginary respectively. If it is real, it will be possible for a standard clock to be 
present at both the events x‘, x‘ + dx‘ and the time which elapses between them as 
measured by this clock will be dr. Alternatively, if ds is imaginary, d s/ic can be 
interpreted as the time between two contiguous events as measured by a standard 
clock present at both. 


46. Motion of a free particle in a gravitational field 


In a region of space which is at a great distance from material bodies, rectangular 
Cartesian axes Oxyz can be found constituting an inertial frame. If time is 
measured by clocks synchronized within this frame and moving with it, the 
motion of a freely moving test particle relative to the frame will be uniform. Thus, 
if (x, y, z) is the position of such a particle at time f, its equations of motion can be 
written 


d 2 x d 2 y d 2 z 
dr 2 dt 2 dr 2 


(46.1) 


Let dsbe the interval between the event of the particle arriving at the point (x,y, z) 
at time t and the contiguous event of the particle arriving at (x + dx,y + dy,z +dz) 
at t + dt. Then ds is given by equation (45.5) and, if v is the speed of the particle, it 
follows from this equation that 

ds = (i> 2 — c 2 ) I/2 dt (46.2) 


Since v is constant, it now follows that equations (46.1) can be expressed in the 
form 


d 2 x d 2 y d 2 z 
ds 2 ds 2 ds 2 


(46.3) 


Also, from equation (46.2) it may be deduced that 



(46.4) 


Equations (46.3) and (46.4) determine the family of world-lines of free particles in 
space- time relative to an inertial frame. 
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Now suppose that any other reference frame and procedure for measuring time 
is adopted in this region of space, e.g. a frame which is in uniform rotation with 
respect to an inertial frame might be employed. Let (x\ x 2 , x\ x 4 ) be the 
coordinates of an event in this frame. The interval between two contiguous events 
will then be given by equation (45.8). If an observer using this frame releases a test 
particle and observes its motion relative to the frame, he will note that it is not 
uniform or even rectilinear and will be able to account for this fact by assuming 
the presence of a gravitational field. He will find that the particle’s equations of 
motion are . 


ds 


+ / ,M 

2 + t J‘ I 


dx J dx‘ 
ds ds 


= 0 


(46.5) 


This must be the case for, as shown in section 43, this is a tensor equation defining 
a geodesic and valid in every frame if it is valid in one. But, in the xyzt-frame, the 
g t j are all constant and the three index symbols vanish. Hence, in this frame, the 
equations (46.5) reduce to the equations (46.3) and (46.4) and these are known to 
be true for the particle’s motion. We have shown, therefore, that the effect of a 
gravitational field of the reducible variety upon the motion of a test particle can be 
allowed for when the form taken by the metric tensor g il of the space- time 
manifold is known relative to the frame being employed. This means that the g { j 
determine, and are determined by, the gravitational field. 

The ideas of the previous paragraph will now be extended to regions of space 
where irreducible gravitational fields are present. It has been pointed out that, for 
any sufficiently small region of such space and interval of time, an inertial frame 
can be found and consequently the paths of freely moving particles will be 
governed in such a small region by equations (46.5). It will now be assumed that 
these are the equations of motion of free particles without any restriction, i.e. that 
the world-line of a free particle is a geodesic for the space-time manifold or that 
the world-line of a free particle has constant direction. This appears to be the 
natural generalization of the Galilean law of inertia whereby, even in an 
irreducible gravitational field, a particle's trajectory through space- time is the 
straightest possible after consideration has been given to the intrinsic curvature of 
the continuum. It will then follow that the motions of particles falling freely in 
any gravitational field can be determined relative to any frame when the 
components g tJ of the metric tensor for this frame are known. Thus the g tJ will 
always specify the gravitational field observed to be present in a frame and the 
only distinction between irreducible and reducible fields will be that, for the latter 
it will be possible to find a coordinate frame in space-time for which the metric 
tensor has all its components zero except 

9n = 922 = 933 ~ J 9*i = —c 2 (46.6) 

whereas for the former this will not be possible. 

It will be proved in section 50 that the assumption we are making can be derived 
from Einstein’s law of gravitation and hence does not constitute an additional 
basic hypothesis of the theory. 
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Since the Christoffel symbols vanish in a frame which is geodesic at some point 
of space-time, in such a frame equations (46.5) reduce to d 2 x'/ds 2 = 0 over a 
small neighbourhood of the point. If, in addition, the frame is chosen to be quasi- 
Euclidean with metric (45.5), equations (46.1) will be valid over the neighbour- 
hood and a freely falling body will have very nearly uniform motion. Such a frame 
can therefore be identified with a local freely falling frame. 


47. Einstein’s law of gravitation 


Accord ing to Newtonian ideas, the gravitational field which exists in any region of 
space is determined by the distribution of matter. This suggests that the metric 
tensor of the space-time manifold, which has been shown to be closely related to 
the observed gravitational field, should be calculable when the matter distribution 
throughout space -time is known. We first look, therefore, for a tensor quantity 
describing this matter distribution with respect to any frame in space-time and 
then attempt to relate this to the metric tensor. The energy-momentum tensor T tj , 
defined in section 21 with respect to an inertial frame, immediately suggests itself. 
Both matter and electromagnetic energy contribute to the components of this 
tensor but since, according to the special theory, mass and energy are basically 
identical, it is to be expected that alt forms of energy, including the elec- 
tromagnetic variety, will contribute to the gravitational field. 

Since the energy- momentum tensor has been defined in inertial frames only, 
this definition must now be extended to apply to a general coordinate frame in 
space-time. This can be carried out thus: In the neighbourhood of a point P of 
space- time, a frame with coordinates y' can be defined which is geodesic at P and 
whose metric reduces to the Euclidean form (39. 13) at P. As explained in the last 
section, this frame will correspond to a local freely falling quasi-inertial frame in 
which the y‘ will behave like Minkowski coordinates; we shall assume that the 
equations of the special theory are valid in this frame at P. The transformation 
equations relating the coordinates y' of an event to its coordinates x‘ with respect 
to any other coordinate frame can now be found. Then, if 7//” are the components 
of the energy-momentum tensor in the y-frame at the point P, its components in 
the x-frame at this point can be determined from the appropriate tensor 
transformation equations. Thus, the covariant energy-momentum tensor will 


have components T )j in the x-frame given by 


•' dx‘dx J 


(47.1) 


Since covariant and contravariant tensors are indistinguishable with respect to 
rectangular Cartesian axes, 7V/” can also be taken to be the components of a 
contravariant tensor in the y-frame and the components of this tensor in the x- 
frame will then be given by the equation 


dx/dx/ 
dy r dy s " 


(47.2) 
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Similarly, the components of the mixed energy-momentum tensor are given by 


ex' t'y' (0 , 
1 cy' cx J ” 


(47.3) 


These transformations can be carried out at every point of space-time, thus 
generating for the x-frame an energy-momentum tensor field throughout the 
continuum. It is left as an exercise for the reader to show that the last three 
equations are consistent, i.e. raising the indices of T tJ as given by equation (47.1) 
leads to T' 1 as given by equation (47.2) (see Exercise 1 at the end of this chapter). 
Consider the tensor equation 

rf = 0 (47.4) 

Expressed in terms of the coordinates y 1 at any point of space-time, this simplifies 
to 

Tf,f = 0 (47.5) 

which is equation (21.20). Being valid in one frame, therefore, equation (47.4) is 
true for all frames. Thus, the divergence of the energy-momentum tensor 
vanishes. If, therefore, this tensor is to be related to the metric tensor g ijt the 
relationship should be of such a form that it implies equation (47.4). Now 


g‘{ = 0 (47.6) 

by equation (39.23) and hence, a fortiori , 

<?;;= 0 (47.7) 

The law T“ = /.g" (47.8) 

where A is a universal constant, would accordingly be satisfactory in this respect. 
However, over a region in which matter and energy were absent so that T‘ j = 0, 
this would imply that 

g‘ J = 0 (47.9) 

which is clearly incorrect. Further, according to Newtonian theory, if g is the 
density of matter, the gravitational field can be derived from a potential function 
U satisfying the equation 

V 2 U = 4 nGg (47.10) 

where G is the gravitational constant. The new law of gravitation which is being 
sought must include equation (47.10) as an approximation. But, as appears from 
equation (21.14), Ti 4 involves g and it seems reasonable, therefore, to expect that 
the other member of the equation expressing the new law of gravitation will 
provide terms which can receive an approximate interpretation as V 2 l/. This 
implies that second-order derivatives of the metric tensor components will 
probably be present. We therefore have a requirement for a second rank 
contravariant symmetric tensor involving second-order derivatives of the <7^ and 
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of vanishing divergence to which T‘ J can be assumed proportional. Einstein's 
tensor (42.20) possesses these characteristics and consequently we shall put 

R‘‘ — ?g' j R = —kT ,j (47.11) 

where k is a constant of proportionality which must be related to G and which we 
shall later prove to be positive. Equation (47.11) expresses Einstein's law of 
gravitation; by lowering the indices successively, it may be expressed in the two 
alternative forms 


= -kT'j 

(47.12) 

Rij-hijR = -KTjj 

(47.13) 

If equation (47.12) is contracted, it is found that 


R = kT 

(47.14) 

where T = Tf It now follows that Einstein’s law of gravitation 
expressed in the form 

can also be 

a 

ii 

kJ- 

«a 

I 

JS 

(47.15) 

with two other forms obtained by raising subscripts. 

Since the divergence of g' J vanishes, a possible alternative to the law (47. 1 1 ) is 

R'l-ytR-Ag" = -kT‘ j 

(47.16) 


where A is a constant. The law (47.1 1) gives results which agree with observation 
over regions of space of galactic dimensions, so that it is certain that, even if A is 
not zero, it is exceedingly small. However, the extra term has entered into some 
cosmological investigations (see Chapter 7). 


48. Acceleration of a particle in a weak gravitational Held 

In a gravitational field, such as the one due to the earth, the geometry of space is 
not Euclidean and no truly inertial frame exists. In spite of this, we experience no 
practical difficulty in establishing rectangular Cartesian axes Oxyz at the earth’s 
surface relative to which for all practical purposes the geometry is Euclidean and 
the behaviour of electromagnetic systems is indistinguishable from their 
behaviour in an inertial frame. It must be concluded, therefore, that such a 
gravitational field is comparatively weak and hence that, with respect to such axes 
and their associated clocks, the space-time metric will not differ greatly from that 
given by equation (45.5). Putting 

x 1 = x, x 2 = y, x 3 = z, x 4 = ict, (48.1) 

in terms of the x‘ the metric will be given by 


ds 2 = dx'dx 1 


(48.2) 



approximately. With respect to the x'-frame, it will accordingly be assumed that 

g ij = S ij + h,j (48.3) 

where the S tJ are Kronecker deltas and the h {J are small by comparison. 

Consider a particle moving in a weak gravitational field whose metric tensor is 
given by equation (48.3). The contravariant metric tensor will be given by an 
equation of the form 

g‘ J = V'+k 11 (48.4) 

where the k' 1 are of the same order of smallness as the /i... Then, since 


ch,> 

dx k 


(48.5) 


(48.6) 


r I, k ] = 1 ( — + ~ ^ 

L J ’ J 2 \ dx' + dx j 5x k J 
it follows that, to a first approximation, 

The equations of motion of the particle can now be written down as at (46.5). 
By equation (46.2), 

tt = = ,v ’ lc * (48-7) 

ds dt ds 

where v is the particle’s velocity in the quasi-inertial frame. Hence, if the particle is 
stationary in the frame at the instant under consideration. 


dx' 

-r = («.!) 

ds 

and the equations of motion (46.5) reduce to the form 

dV . : 


ds 2 


0 


(48.8) 


(48.9) 


correct to the first order in the h tJ . Substituting from equation (48.6), this is seen to 
be equivalent to 


ds 


Differentiating equation (48.7) with respect to s and making use of equation 
(46.2), we obtain 

= (v 2 -c 2 y ] — c 2 )‘ 2 (v,ic) (48.11) 


and, when v = 0, this reduces to 
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From equations (48.10) and (48.12), we deduce that the components of the 
acceleration of the stationary particle in the directions of the rectangular axes are 


/ 1^44 c/i 4 , \ 

\2 cx‘ cx* ) 


(48.13) 


for i = I, 2, 3. Reverting to the original coordinates (x, y, z, t), these components 
are written 


2 /lc/i44 i r/i 41 \ 
C (,2 dx c dt )' 


etc 


(48.14) 


Hence, if the field does not vary with the time, the acceleration vector is 

- grad (|c 2 (144) (48.15) 

But, if V is the Newtonian potential function for the field, this acceleration will be 
— grad V. It follows that, for a weak field, a Newtonian scalar potential V exists 
and is related to the space-time metric by the equation 

U=$c 2 h ii (48.16) 

Alternatively, we can write 

2U 

.944 = 1 +— (48.17) 

c z 


49. Newton’s law of gravitation 


In this section it will be shown that Newton’s law of gravitation may be deduced 
from Einstein's law in the normal case when the gravitational field's intensity is 
weak and the matter distribution is static. 

First consider the form taken by the Riemann-Christofiel tensor in the 
space-time of a weak field. In the x'-frame, the metric tensor is given by equation 
(48.3) and the Christoffel three-index symbols by equation (48.6). If products of 
the hij are to be neglected, equation (36.21) shows that 



c 

i \ 1 I \ 

jli ~ J \jk) 


(49.1) 


approximately. Hence the Ricci tensor is given by 


c , t 



A k i J i 


■ ! A 1 

cx 

CX 


.ill 

( ' h J. 

_ 

2 cx k { 

dx 1 

cx 1 

1 c 


ch ki 

2dx 

1 [ox‘ 

dx 1 


chi 


cx 
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= 1 { g2/i » + _ d2f, ‘j _ 1 

iljcxG'x' 1 cx'3x‘ cx‘cx k ex' ex' J 
In particular, putting j = k = 4, we find that 

n 1 \ d 2 h it _ 3 2 /i 44 t 1 

^44 — 4 a 4' a i ^ j ^ i ^ 4 f 

2[cx OX ox ox ox ox ] 


(49.2) 


(49.3) 


If the matter distribution is static in the quasi-inertial frame being employed, the 
h,j will be independent of i and equation (49.3) reduces to 

*44 = 1^44 < 49 - 4 ) 

where V 2 = c 2 /dx 2 +v 2 /dy 2 +d 2 /dz 2 . If U is the Newtonian potential for the 
field, equation (48.16) now shows that 


R 


44 


v 2 c 

e 


(49.5) 


Assuming that no electromagnetic field is present and that the contribution to 
the energy-momentum tensor of any stress forces within the matter distribution 
responsible for the gravitational field is negligible, T it will be determined by 
equation (21.16). But, since the distribution is static, its 4-velocity of flow V at 
every point is (0, ic) and hence all components of 7',,, with the exception of are 
zero. In this case, 

= -c 2 fi 0 o (49.6) 

where n 00 , for zero velocity of matter, is the ordinary mass density. Also 

T = T; = T ti = T ia ,= - c 2 [i 00 (49.7) 

The 44-component of Einstein’s gravitation law in the form of equation (47.15) 
can now be expressed approximately 


or 


-T n 00 

c 

v 2 v = JKC-Voo 


(49.8) 


This is the Poisson equation (47.10) of classical Newtonian theory, provided we 
accept 


8rtG 


(49.9) 


This specifies k in terms of the gravitational constant. 


50. Freely falling dust cloud 

Consider the case of a cloud of particles falling freely in the field of the cloud itself, 
there being no other forces present in the system. The energy-momentum tensor 
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for such an incoherent cloud has been calculated relative to an inertial frame as 
equation (21.16). This equation will be taken to provide a definition of T tj in the 
freely falling frame at any point of the cloud. 

In an arbitrary x-frame, let x 1 = x‘(x) be parametric equations of the world-line 
of some particle of the cloud, r being the proper time measured by a standard 
clock moving with the particle. Then the 4- velocity of flow of the particle at time r 
is defined by the equation 


V‘ = 


dx' 

dr 



(50.1) 


where sis the interval parameter measured along the world-line. The square of the 
magnitude of the 4-velocity of flow is 

= 9ijV i V i = = -c 2 (50.2) 

This equation can also be written 

VjV‘ = -c 2 (50.3) 

Now consider the tensor equation 

T' s = » 00 V V‘ (50.4) 

where /r 00 is the mass density as measured in a freely falling frame moving with 
the cloud; <s clearly a 4-invariant. In any freely falling frame, this equation 
reduces to equation (21. 16) and is accordingly valid; this establishes its validity in 
all frames. 

Equation (47.4) is known to follow from Einstein's equation of gravitation and, 
in this case, takes the form 

(^00 v‘ y% = (voo n, v + noo y j = o (50.5) 

Multiplication by now gives 

Utooy%V , K + lhoV l . l ViV i -0 (50.6) 

Differentiating equation (50.3) with respect to x J , we find 

V, / V'+ Vi V\ = 0 (50.7) 

Raising and lowering the index i in the two factors of the first term, this equation 
is seen to be equivalent to 

r V, = 0 (50.8) 

Equation (50.6) accordingly reduces to 

(^oo y% = 0 (50.9) 


r V> = 0 

• J 


Equation (50.5) now gives 


(50.10) 
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or 

(0V‘ ,\ 



( f v + r ^T =0 

(50.11) 

Hence 

f l l / 'dx J dx‘dx J 

(50.12) 


r.x J dr ~ kt dt dt 

or 

d 2 x' dx‘dx j 



d? + n '7 7 ~ 0 

(50.13) 

This can also be written 

d 2 x' dx*dx J 



d7 + ri2 7 7=° 

(50.14) 


a result which proves that the world-lines of the particles of the cloud arc 
geodesics. 

That the world-lines of freely falling particles in general circumstances are 
geodesics can be derived from equation (47.4), proving that Einstein's law 
includes its own law of motion for a particle in a gravitational field. 


51. Metrics with spherical symmetry 

When a change is made in the space -time coordinate frame from coordinates x 1 to 
coordinates x\ the metric tensor g tJ will change to g tJ by the law of transformation 
of a covariant tensor. In general, the g tj will be functions of the x‘ and the g tj will 
be functions of the x\ but it will not usually be the case that the g {J are the same 
functions of the 'barred' coordinates that the g tJ are of the 'unbarred' coordinates, 
i.e., the functions ^^(x* ) are not form invariant under general coordinate 
transformations. However, in some special cases, it is possible for these functions 
to be form invariant under a whole group of transformations, and we shall study 
such a case in this section. 

In a gravitational field, the geometry can only be quasi-Euclidean and 
consequently rectangular Cartesian axes do not exist. Nevertheless, no difficulty 
is experienced in practice in defining such axes approximately and we shall 
suppose, therefore, that the coordinates x, y, z, t of an event in the gravitational 
field about to be considered are interpreted physically as rectangular Cartesian 
coordinates and time. We shall now search for a metric which, when expressed in 
those coordinates, is form invariant with respect to the group of coordinate 
transformations which will be interpreted physically as rotations of the 
rectangular axes Oxyz (f is to remain unaltered). To be precise, it will be supposed 
that spatial coordinates (x, y, z)can be defined such that the metric y,j(x, >•, z, t) is 
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form invariant under the group of orthogonal transformations x = Ax, where 
x = (x, y, z) r , x = (x, y, z) T and AA r = I. Such a metric will be said to be 
spherically symmetric about O. 

Invariants for this group of coordinate transformations, which are of degree no 
higher than the second in the coordinate differentials dx, d>\ dz, are 

x 2 + v 2 + z 2 , xdx + ydy + zdz, dx 2 + d> 2 + dz 2 (51.1) 

Introducing spherical polar coordinates (r, 0, $), which will be defined by the 
equations (30.3), these invariants may be written 

r 2 , rdr, dr 2 + r 2 d0 2 +r 2 sin 2 0d$ 2 . (51.2) 

It follows that r, dr, d0 2 + sin 2 0dd> 2 (51.3) 


are invariants. The most general metric with spherical symmetry can now be built 
up in the form 

ds 2 = A(r,t)dr 2 + B(r, t) (dd 2 + sin 2 0d0 2 ) 

+ C(r,t)drdt + D(r,t)dt 2 (51.4) 


We now replace r by a new coordinate r' according to the transformation 

r' 2 = 8(r, t) (51.5) 

ds 2 = £(r',f)dr' 2 + r' 2 (d0 2 + sin 2 0d0 2 ) 

+ F(r', f Jdr'df +G(r', r ) dr 2 (51.6) 


equation 

Then 


In a truly inertial frame, spherical polar coordinates can be defined exactly and the 
metric will, by equation (45.5), be expressed in the form 

ds 2 =dr 2 +r 2 (d0 2 +sin 2 0d0 2 )-c 2 dt 2 (51.7) 


Comparing equations (51.6) and (51.7), it is clear that in a region for which (51.6) 
is the metric, r' will behave approximately like a true spherical polar coordinate r. 
We shall accordingly drop the primes and write 

ds 2 = E(r, I ) dr 2 +r 2 (d0 2 + sin 2 0d0 2 ) 

+ F(r,t)drdf + G(r,r)df 2 (51.8) 

If our frame is quasi-inertial, equation (51.7) must be an approximation for 
equation (51.8) and the following equations must therefore be true 
approximately: 

E(r,t)= 1, F (r, t ) = 0, G{r,t)=-c 2 (51.9) 

Consider now the special case when the gravitational field is static in the quasi- 
inertial frame for which (r, 0, <p) are approximate spherical polar coordinates and t 
is the time. The functions E, F, G will then be independent of t. Also, space-time 
will be symmetric as regards past and future senses of the time variable and this 
implies that ds 2 is unaltered when dt is replaced by - dt. Thus F = 0 and we have 

ds 2 = «dr 2 + r 2 (d0 2 + sin 2 0d0 2 ) — he 2 dt 2 (51.10) 
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where a, b are functions of r both approximating unity in a weak field. 

At any fixed instant t, the metric of space in the presence of this gravitational 
field can be obtained from the last equation by putting dt = 0. Thus, it is 

ds 2 = adr 2 + r 2 (d0 2 + sin 2 0d<£ 2 ) (51.1 1) 

Consider the ‘circle’ r = r 0 in the ‘plane’ 9 = jtt. The length of the element of the 
circle with end points (r 0 , (j> ), (r 0 , <p +d <j>\ has length ds = r 0 d(j>. Thus the total 
length of the circle is 2 nr 0 . However, r 0 will not be the length of a radius <p = <p 0 of 
this circle for, if an element of such a radius has end points (r, <t> 0 ), (r +dr, <p 0 ), the 
length of the element is ds = a ] 2 dr and the total length of the radius is 
accordingly 

f '<> 

a 1 ' 2 dr (51.12) 

J o 

Clearly, the Euclidean formula for the circumference of a circle of given radius 
does not apply. 

For the metric (51.10), taking 

x 1 = r, x 2 = 6, x 3 = 0, x* = t (51.13) 

we have 

922 =r 2 , g i} = r 2 sin 2 0, g^^-bc 2 (51.14) 

all other g tj being zero. Thus 

g=-abc 2 r* sin 2 0 (51.15) 

and hence 


9 


1 1 _ 


1 

a’ 



2 sin 2 0’ ® 



(51.16) 


all other g“ being zero. The three-index symbols can now be calculated and, 
putting a = b = e f , those which do not vanish are listed below: 

A 


1 

1 1 

2 

1 2 

3 

1 3 

4 

1 4 
1 

2 2 




2 

2 1 


= l/i" 

1 ir 


-{.’.I- 


{,'4 




(51.17) 
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3 

2 3 

1 

3 3 
2 

3 3 
1 

4 4 


= cot 0 


3 2 ' 

= - re~ a sin 2 6 

= -sinflcosf) 


= *c 2 /?V-‘ 


primes denoting differentiations with respect to r. 

The non-zero components of the Ricci tensor are now calculated to be: 


Kn = \P" + iP ' 2 -*«'/*'- **' 

*22 -e-*(irP’-ir«'+l)-l 

Rj 3 = R 22 sin 2 0 

R 44 = cV"( - l iP"-iP' 2 +ia'p- - V) 


(51.18) 


52. Schwarzschild’s solution 

The static, spherically symmetrical metric (51.10) will determine the gravitational 
field of a static distribution of matter also having spherical symmetry, provided it 
satisfies Einstein’s equations (47.15). We shall consider the special case when the 
whole of space is devoid of matter, apart from a spherical body with its centre at 
the centre of symmetry O. Then T tJ = 0, T = 0 at all points outside the body and 


Einstein’s equations reduce 

in this region to 



= 0 

(52.1) 

By equations (51.18), these 

are satisfied by the metric (51.10), provided 


P‘ 

" + jP' 2 - {a P' - -a' =0 
r 

(52.2) 


\rP' - + 1 = e 1 

(52.3) 

P' 

'+}P ' 2 -ix’P‘ + 2 p' = 0 

r 

(52.4) 

Subtracting equation (52.2) 

from (52.4), it follows that 



a + P = constant 

(52.5) 


But, as r -> 00 , we shall assume that our metric approaches that given by equation 
(51.7), valid in the absence of a gravitational field. Thus, at infinity, a = = Oand 
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hence 

a + 0 = 0 (52.6) 

Eliminating p from equation (52.3). it will be found that 

r'x=\-e I (52.7) 

The variables are separable and this equation is easily integrated to yield 

a = e* = (1 — 2m/r)“ 1 (52.8) 

where m is a constant of integration. Then 

b = e t =\- 2m/ r (52.9) 

and it may be verified that each of the equations (52.2)— (52.4) is satisfied by these 
expressions for a and p. 

We have accordingly arrived at a metric 

ds 2 = - — |-r 2 ( dO 2 + sin 2 0d <p 2 )-c 2 ( 1 — — dr 2 (52.10) 

1 - (2m/r) V r ) 


which is spherically symmetrical and can represent the gravitational field outside 
a spherical body with its centre at the pole of spherical polar coordinates (r, 9, </>). 
This was first obtained by Schwarzschild. It will be proved in the next section that 
the constant m is proportional to the mass of the body. This may also be deduced 
from equation (48.17), for the potential V at a distance r from a spherical body of 
mass M is given by 



CM 

r 

(52.11) 

and hence 

, 2 GM 

044 1 2 

(52.12) 


Now g 44 is the coefficient of (dx 4 ) 2 = -c 2 dr 2 in the metric and hence 


b = 


2 GM 


(52.13) 


Comparing equations (52.9) and (52.13), it will be seen that 

CM 


(52.14) 


It is clear from equation (52.10) that the metric is not valid for r = 2m 
= 2 GM/c 2 . This is the Schwarzschild radius. In SI units, c = 3 x 10 8 and for the 
earth GM = 3.991 x 10 14 , so that the radius for this body is about 9 mm; since the 
metric is only applicable in the region outside the earth, no difficulty is 
encountered in this case. However, for an exceptionally dense body, the radius 
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may extend into the surrounding space and, for values of r less than the radius, the 
metric needs special consideration (see section 57). 


53. Planetary orbits 

The attractions of the planets upon the sun cause this body to have a small 
acceleration relative to an inertial frame. If, therefore, a coordinate frame moving 
with the sun is constructed, relative to this frame there will be a gravitational field 
corresponding to this acceleration in addition to that of the sun and planets. 
However, for the purpose of the following analysis, this field and the fields of the 
planets will be neglected. Thus, relative to spherical polar coordinates having 
their pole at the centre of the sun, the gravitational field will be assumed 
determined by the Schwarzschild metric (52.10). The planets will be treated as 
particles possessing negligible gravitational fields, whose world-lines are 
geodesics in space-time. We proceed to calculate these geodesics. 

Since the intervals between adjacent points on the world-line of a particle arc 
necessarily timelike, s will be purely imaginary along such a curve. When 
calculating geodesics it is usually more convenient, therefore, to replace s by r and 
to work from the metric expression for dr 2 . Thus, in this section, the 
Schwarzschild metric will be taken in the form 

I / dr 2 \ 

dr 2 = — = (- — — - + r 2 (d0 2 + sin 2 Odcp 2 ) ) + (1 - 2m/r)di 2 (53.1) 
r \1 - 2 m/r ] 

and equations (43.19) for the geodesies become 


d 

dr 


( r dr\ m /dr\ 2 /dfA 2 
\r - 2m dry + (r -2m) 2 \drj \dr/ 




= 0 


d 

dr 



sin 6 cos 0 



= 0 


d^ 

dr 



= 0 


(53.2) 

(53.3) 

(53.4) 


d fr — 2m dt\ 
dr\ r dr/ 


= 0 


(53.5) 


The first of this set of equations will be replaced by the first integral (43.6), viz. 


r - 2m\dt 


+ r‘ 


fdf)V 2 

( — ) + stn 2 6 
\dr J ' 


dip'' 2 
dr 


„ . dt V 


(53.61 
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We now choose the spherical polar coordinates so that the planet is moving 
initially in the plane 9 = Then dO/dr = 0 initially and hence, by equation 
(53.3), d 2 0/dr 2 = Oat this instant. By repeated differentiation of this equation and 
substitution of initial values, it is found that all derivatives of 9 vanish initially. 
Hence, by Maclaurin’s theorem, 0 = {n for all values of r, proving that the planet 
continues to move in the ‘plane' 0 = indefinitely. 

Integrating equations (53.4) and (53.5), and putting 0 = jti, we get 


d (j> h 

dr r 2 
dt kr 
dr r — 2m 


(53.7) 

(53.8) 


where h and k are constants of integration. 

Substituting for d^/dr, dt/drfrom the last two equations and putting 6 = jTtin 
equation (53.6), it follows that 



+ -j (r -2m) = c 2 (k 2 — 1) + 


2mc 2 


(53.9) 


Then, eliminating dr between this equation and equation (53.7), we obtain the 
equation for the orbit, viz. 


h_ dr 
r 2 d(p 


2 me 2 2mh 2 


+ ~2 = c 2 (k 2 -1) + + 


With u = 1/r, this reduces to the form 


du ' i2 
d 4> 


+» 2 = r2ik 2 


, 2mc 2 , , 

1) + — x u + 2 mu 3 
h 


(53.10) 


(53.11) 


Differentiating through with respect to tp, this equation takes a form which is 
familiar in the theory of orbits, viz. 


d 2 u me 2 , , 

w + u = l? + 


(53.12) 


The corresponding equation governing the orbit according to classical 
mechanics is 


d 2 u CM 

d <£ 2 + U h 2 


(53.13) 


where M is the mass of the attracting body and h is the constant velocity moment 
of the planet about the centre of attraction, i.e. 


d 0 
"dT 


h 


(53.14) 
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If we identify the time variable t of classical theory with the proper time r in the 
relativistic theory, equations (53.7) and (53.14) become identical and our choice of 
h for the constant in equation (53.7) is justified. Also, provided we take 


CM 


m = — 
r 


2 


(53.15) 


(confirming equation (52.14)), equation (53.12) corresponds to the classical 
equation (53. 1 3), although there is now an additional term 3 mu 2 . The ratio of the 
additional term 3 mu 2 to the ‘inverse square law’ term mc 2 /h 2 is 


3 h 2 u 2 



(53.16) 


by equation (53.14). r<j) is the transverse component of the planet’s velocity and, 
for the planets of the solar system, takes its largest value in the case of Mercury, 
viz. 4.8 x I0 4 m/s. Since c = 3x 10 s m/s, the ratio of the terms is in this case 7 • 7 
x 10 s , which is very small. However, the effect of the additional term proves to 
be cumulative, as will now be proved, and for this reason an observational check 
can be made. 

The solution of the classical equation (53.13), viz. 


u ~ Ti 1 1 +<? c °s(0 - «)} (53.17) 

h 

where n = GM = me 2 , e is the eccentricity of the orbit and w is the longitude of 
perihelion, will be an approximate, though highly accurate, solution of equation 

(53.12) . Hence the error involved in taking 

3mu 2 = — pr~{l +ecos(tp -«)} 2 (53.18) 

h 

will be absolutely inappreciable, since this term is very small in any case. Equation 

(53.12) can accordingly be replaced by 

+ +ecos(0-d>)} 2 (53.19) 

This equation will possess a solution of the form (53.17) with additional 
‘particular integral’ terms corresponding to the new term (53.18). These prove to 
be as follows: 

-£-{l + jc 2 -ge 2 cos 2(<t> -to) + e<p sin (0 -<u)} (53.20) 

The constant term cannot be observationally separated from that already 
occurring in equation (53.17). The term in cos2 (</> — d») has amplitude too small 
for detection. However, the remaining term has an amplitude which increases 
with $and its effect is accordingly cumulative. Addingthis to the solution (53.17), 
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we obtain 


r- I , 3 m ne 

u = -v < 1 +e cos(<p — H — -z— <p sin (<p — <w) 


= ^2 {1 +ecos(<p — d>- 5a>)} 


(53.21) 


where 6u> = 3mjU$//i 2 and we have neglected terms 0(5tb 2 ). 

Equation (53.21) indicates that the longitude of perihelion should steadily 
increase according to the equation 


c, 3 mfj 3/i 2 3 ft 

5c0 = -p-' P = ^ <P = Sl ,P 


(53.22) 


where / = h 2 /n is the semi-latus rectum of the orbit. Taking ju = 1 • 33 x 10 2 ° SI 
units for the sun, c = 3 x 10 8 and / = 5 • 79 x 10 10 for Mercury, it will be found 
that the predicted angular advance of perihelion per century for this planet's orbit 
is 43". This is in agreement with the observed value. The advances predicted for 
the other planets are too small to be observable at the present time. 


54. Gravitational deflection of a light ray 

In section 7 it was shown that the proper time interval between the transmission 
of a light signal and its reception at a distant point is zero. It was there assumed 
that the signal was being propagated in an inertial frame and hence that no 
gravitational field was present. This result can be expressed by saying that 

d.s = 0 (54.1) 


for any two neighbouring points on the world-line of a light signal. Now, null 
geodesics in the space-time having metric (45.5) are defined by equation (54.1) 
and the equations 

rl ^ v i- rl ^ 

(54.2) 


d A 2 


d 2 )- 

dA 2 


d 2 / 

dA 2 


d 2 t 

d7 2 


for the three index symbols are all zero. Equations (54.2) imply that along a null 
geodesic x, y, z are linearly dependent upon i. But this is certainly true for the 
coordinates of a light signal being propagated in an inertial frame. We conclude 
that the world-lines of light signals are null geodesics in space-time, in this case. 

Since an inertial frame can always be found for a sufficiently small space-time 
region even in the presence of a gravitational field, it follows that the world-line of 
a light signal in any such region is a null geodesic. We shall accept the obvious 
generalization of this result, viz. that the world-lines of light signals over an 
unlimited region of space-time are null geodesics. 

We shall now employ this principle to calculate the path of a light ray in the 
gravitational field of a spherical body. Taking the space-time metric in the 
Schwarzschild form (equation (52.10)), the equations governing a null geodesic are 
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identical with the equations (53.2>— (53. 5) after t has been replaced by A. The first 
integral (43.13) takes the form 


r 

r -2m 



+ r 





( r - 2m) 



= 0 


(54.3) 

Without loss of generality, we shall again put 9 = jtt, so that a ray in the 
equatorial plane is being considered and then proceed exactly as in the last section 
to derive the equation 


d 2 u 

— — s + u — 3 mu 2 
d <p 


(54.4) 


where u = 1/r. This equation determines the family of light rays in the equatorial 
plane. 

As a first approximation to the solution of equation (54.4), we shall neglect the 
right-hand member. Then 

1 

u = - cos(<p + a) (54.5) 

R 


where R, aare constants of integration. This is the polar equation of a straight line 
whose perpendicular distance from the centre of attraction is R. As might have 
been expected, therefore, provided the gravitational field is not too intense, the 
light rays will be straight lines. This deduction is, of course, confirmed by 
observation. Thus, as the moon’s motion causes its disc to approach the position 
of a star on the celestial sphere and ultimately to occult this body, no appreciable 
deflection of the position of the star on the celestial sphere can be detected. 

Again, without loss of generality, we shall put a = 0 so that the light ray, 
as given by equation (54.5), is parallel to the >’-axis (0 = ±{n). Then, putting 
u = cos (p/R in the right-hand member of equation (54.4), this becomes 


d 2 u 3m , 
— t + „, w co, * 


(54.6) 


The additional ‘particular integral’ term is now found to be 


ry (2 - cos 2 ({>) (54.7) 

K 

and hence the second approximation to the polar equation of the light ray is 

1 tYl 

u=— cos<£-i — 2 (2 -cos 2 <p) (54.8) 

R R 

At each end of the ray u = 0 and hence 

m 2 a* - 2m 

— cos (p - cos (p — — = 0 

R R 


(54.9) 
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Assuming m/R to be small, this quadratic equation has a small root and a large 
root. The small root is approximately 


and hence 



(54.10) 

(54.11) 


at the two ends of the ray. The angular deflection in the ray caused by its passage 
through the gravitational field is accordingly 


4m 

~R 


(54.12) 


approximately. 

For a light ray grazing the sun’s surface, 

R = sun’s radius = 6-95 x 10 8 m and m = 1-5 x 10 3 m 

Thus the predicted deflection is 8-62 x 10~ 6 radians, or about 1-77". This 
prediction has been checked by observing a star close to the sun's disc during a 
total eclipse. The experimental findings are in accord with the theoretical result. 


55. Gravitational displacement of spectral lines 

A standard clock will be taken to be any device which experiences a periodic 
motion, each cycle of which is indistinguishable from every other cycle. The 
passage of time between two events which occur in the neighbourhood of the 
clock is then measured by the number of cycles and fraction of a cycle which the 
device completes between these two instants. The clocks employed to determine 
the time coordinate £ 4 of an event in section 45 were not, necessarily, standard 
clocks. Such coordinate clocks can have arbitrary variable rates, the only 
requirement being that, if A, B are two events in the vicinity of a coordinate clock 
and B occurs after A, then the coordinate-time for B must be greater than the 
coordinate-time for A. 

The successive oscillations of atoms governing the motion of a modern atomic 
clock are indistinguishable from one another and it has been assumed that such a 
clock is being used whenever standard time is measured. The constancy of the rate 
of this fundamental physical process is not susceptible to experimental check, 
since it is the standard against which all other rates (e.g. the rate of rotation of the 
earth) are measured. By international agreement, one second is the time which 
elapses when a specific type of atomic system performs a specific number of 
oscillations and this definition applies in all regions of the cosmos and at all 
epochs; this fact should be borne in mind when phrases such as ‘the first second 
after the big bang' occur in cosmological studies. 
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As explained in section 45, if x', x‘ + dx' are the space- time coordinates of two 
adjacent events, then dr = ds/ic is the time separating the events as measured by a 
standard clock which is present at both events. It is assumed that the interval 
between the events is timelike (i.e. dr real) and that the clock is in a state of free fall 
during its passage from one event to the other; alternatively, if the clock is not 
freely falling, it is assumed that any effect on its rate of the gravitational field it 
experiences is corrected for. 

Let x 1 (i = 1, 2, 3,4) be the coordinates of an event with respect to some 
space-time reference frame, x‘,x 2 , x 3 being interpreted physically as spatial 
coordinates relative to a static frame and x 4 /ic as time. If a standard clock is at rest 
relative to this frame, for adjacent points on its world-line dx' = dx 2 = dx 3 = 0 
and hence 

dr 2 = - ds 2 /c 2 = —<? 4 4 (dx 4 ) 2 /c 2 = 0 44 dt 2 (55.1) 


where we have put x 4 = id. The time t measured by the standard clock is 
therefore related to the time t shown on the coordinate clock at (x ‘, x 2 , x 3 ) by the 
equation 



(55.2) 


In the special case of the coordinate frame employed in section 48 which was 
stationary in a relatively weak static gravitational field, it was proved that g 4a is 
given in terms of the Newtonian scalar potential V for the field by the 
approximate equation (48.17). Thus 


dr 



(55.3) 


relates time intervals measured by a stationary standard clock and a coordinate 
clock at a point in a gravitational field where the potential is U. Now, when it is 
emitting its characteristic spectrum, an atom is operating as a standard clock. 
Consider, therefore, an atom for which the period (from standard tables) of one 
complete cycle of radiation corresponding to a certain spectral line is r. If such an 
atom is stationary in the frame at a point P where the potential is 1) „ the time for 
one complete cycle of the radiation as measured by a standard clock at the point 
will be t and the coordinate-time for the cycle will be t, where 

T= 7(1+ 21/,/c 2 ) t (55.4) 

Suppose this radiation is received at another fixed point Q where the potential is 
U 2 - Let Tbe the difference between the coordinate-time of emission of light from 
P and the coordinate-time of its reception at Q; since the gravitational field is 
being assumed static and P, Q are fixed, T will be a fixed constant. Thus, if 
successive crests of the light wave are emitted from P at coordinate-times t 0 , 
! 0 + 1 , these crests will be received at Q at coordinate-times t 0 +T,t 0 +T+t. It 
follows that the period of the radiation as measured by the coordinate clock at Q 
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will also be r. However, a standard clock at Q will measure the period to be r', 
where 

r'=7(l+2t I 2 /c 2 )i (55.5) 

Hence, if v is the standard frequency of the spectral line being observed and v' is 
the observed frequency of the line at Q, then 



In particular, if t/ 1 < U 2 , the observed light will have its frequency shifted 
towards the red. 

In the case of an atom on the surface of the sun observed from a point on the 
earth’s surface, it will be found that, in SI units, 

(7, = - 1.914 x I0 lt , U 2 = -9.512 x 10 8 
and thus v' = 0.9999979 v (55.7) 

This effect is so small that it is very difficult to measure. However, in the case of the 
companion of Sirius, the predicted effect is 30 times larger and has been 
confirmed by observation 


56. Maxwell's eqnations in a gravitational field 

Over any sufficiently small region of space and restricted interval of time it is 
possible to define a rectangular Cartesian inertial frame, i.e. the frame in ‘free fall’ 
in the gravitational field. If the electric and magnetic components of the 
electromagnetic field are measured in this frame, the field tensor F tj defined by 
equation (26.5) can be found. Employing the appropriate transformation 
equations, the components of this tensor relative to general coordinates x 1 in the 
gravitational field can be computed. No distinction is made between covariant 
and contravariant properties relative to the original inertial frame so that, when 
transforming, F {J may be treated as a covariant, contravariant or mixed tensor. If 
it is treated as a covariant tensor, the covariant components in the general x 1 - 
frame will be generated. If it is treated as a contravariant or as a mixed tensor, the 
contravariant or mixed components F'\ F) respectively will be generated. In this 
way, the field tensor is defined at every point of space-time. Similarly, a current- 
density vector with covariant components J ( and contravariant components J‘ is 
defined relative to the x‘-frame. 

Consider the equations 

F'* = Ho J‘ (56.1) 

'\„. + ^, + 'V, = 0 (56.2) 

These are tensor equations and hence are valid in every space-time frame if they 
are valid in any one. But, relative to the inertial coordinate frame (x, >\ z, ict ) 
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which can be found for any sufficiently small space- time region, these equations 
reduce to equations (26. 1 1 ) and hence are valid over such a region. Regarding the 
whole of space-time as an aggregate of such small elements, it follows that 
equations (56.1), (56.2) are universally true. 

Since F‘‘ is skew-symmetric. 


F ‘j 

■J 




t F' 1 1 c 

dx j + -g)cx' 




i e 

s /(-g) ex 1 




(56.3) 


by equation (42.5) (g has been replaced by - y, since g is always negative for a real 
gravitational field). Equation (56.1) is accordingly equivalent to 


1 

\/(-g) 


{ V f — g)F‘‘} — FoJ’ 


(56.4) 


Also, in view of the skew-symmetry of the field tensor, it follows that equation 
(56.2) is equivalent to 


cx cx CX J 


(56.5) 


The energy- momentum tensor for the field is found from equation (29.5) to be 
given by 

Ho S) = F ik F jk -iS‘ j F k ‘F kl (56.6) 

It now follows from Maxwell’s equations that 

S[,= - Fjj J j = ~ D, (56.7) 

where £>, is the 4-force density acting upon the charge distribution. 


57. Black holes 

The Schwarzschild metric is only valid in the region outside a spherically 
symmetric attracting body. Thus, if the radius of this body exceeds 2m, the 
circumstance that the component g, , of the metric tensor becomes infinite at 
r = 2m creates no difficulty. If, however, the body’s radius is less than 2m, the 
sphere r = 2m lies in empty space and the nature of the field in the vicinity of this 
sphere needs careful study. 

Although the metric is clearly invalid over the sphere r = 2m, it is an acceptable 
solution of the Einstein equation in the region 0 < r < 2m. Consider a body 
moving radially in this region, not necessarily in a state of free fall. Then 9 and <p 
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will both be constant and the metric equation reduces to 

c 2 dr 2 = a~ 1 (dr 2 — c 2 a 2 dr 2 ) (57.1) 

where x = 2mjr - 1 > 0, along the body's world-line. Given the equation of 
motion r = r(t), this equation determines the proper time r shown on a standard 
clock moving with the body. But dr must be real and it follows that either (i) dr/dr 
> cx or (ii) dr/dr < — cx. These inequalities show that it is impossible for a body 
to be stationary relative to our coordinate frame in this region. This implies that 
our picture of the frame as a set of coordinate clocks measuring the time t and 
stationary at the points (r, 0, $) ceases to be applicable. Evidently, the static 
conditions we have been envisaging in the neighbourhood of the attracting body 
are not present in this region. 

Next, consider a body falling freely along a radius towards the centre of 
attraction in the region r > 2m. Taking as initial conditions t = 0, r = R, 
dr/dt = 0, equations (53.5) and (53.6) lead to the equation of motion 


Thus 



(57.2) 

(57.3) 


and it is clear that this integral diverges to + x as r -* 2m. This means that, in the 
Schwarzschild frame, the body will need an infinite coordinate time to reach the 
sphere r - 2m. If the body is observed optically by an observer stationed at a 
considerable distance from the centre of attraction, since allowance must be made 
for the coordinate time needed for photons leaving the body to reach his 
telescope, the observed motion of the body, as measured by his coordinate clock, 
will be further retarded But his coordinate clock will be almost indistinguishable 
from a standard clock and it follows that the apparent time of fall of the body to 
the Schwarzschild radius according to an external observer using an atomic clock 
will also be infinite. 

If, however, instead of eliminating the proper time x between equations (53.5) 
and (53.6), the coordinate time i is eliminated, the resulting equation is 



After integration with r = 0 at r = R, this gives 


cr = ^ (R 3 /2m) [ V(p-p 2 ) + £cos -1 (2p- 1)] (57.5) 

where p = rjR and the inverse cosine is taken in the first or second quadrants, r 
will be the time recorded by a clock moving with the body and equation (57.5) 
shows that this remains finite for values of r through the value 2m to zero. 

It is now evident that the reference frame we have been using is unacceptable if 
motions across the Schwarzschild sphere are to be studied and that, in particular. 
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the coordinate time r becomes infinite at r = 2m for some events which can occur 
in the experience of certain observers. It appears, therefore, that it is a deficiency in 
the reference frame which is responsible for the anomaly in the metric and our 
expectation is that the infinity can be removed by transformation to a new frame. 
This view of the matter is supported by the fact that g is finite at r = 2m, indicating 
that there is no singularity of space-time in this region. 

The suggestion arising from our calculations is that t should be replaced by a 
new coordinate time u defined by a transformation equation 

u = t + J\r) (57.6) 

where /(r) becomes negatively infinite at r = 2m in such a way as to cancel the 
infinity which we have seen to arise in t for certain events taking place on r = 2m. 
Substituting 

d( = du — /'(r)dr (57.7) 

in the Schwarzschild metric, this transforms to 

ds 2 = Fdr 2 + r 2 (dfl 2 + sin 2 0d(p 2 ) + c 2 (1 - 2 m/r) (2/'drdu — du 2 ) (57.8) 

where 

F — ~—z~ — — (r - 2m) j' 2 (57.9) 

r — Zm r 

We can now remove the infinity by choosing/(r) such that 

cf = r/(r — 2m) (57.10) 

Thus, we take 

c/(r) = r + 2m log (r — 2m) (57.11) 

and the metric then assumes the form 

ds 2 = r 2 (d0 2 + sin 2 0d0 2 )+ 2cdrdw — c 2 (l -2m/r)du 2 (57.12) 

This metric must clearly satisfy Einstein’s equation in vacuo. However, the field 
in the new frame is no longer static in the sense assumed in section 5 1 ; the presence 
of a term involving the first power of du shows that the field is not symmetric with 
respect to the past and future, i.e. the sense of description of its trajectory by a 
freely falling particle cannot be reversed with impunity. 

Let us study, once again, a body moving radially, but not necessarily falling 
freely. Along its world-line, we have 

ds 2 = 2cdrdu — c 2 ( 1 —2m/r)du 2 (57.13) 

Since ds 2 must be negative for any possible motion, 


— < jc (1 2m/r) 

du 


(57.14) 
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If r < 2m, this implies that dr/du is negative and that the body must move towards 
O; in particular, it cannot remain stationary. Thus, this is a region of irresistible 
collapse towards the centre of attraction for all physical bodies. It will be noted 
that the transformation has eliminated the possibility of outwards radial motion 
which existed when the Schwarzschild form of the metric was taken; this 
possibility can be recovered by changing the sign of u. 

Now consider the motion of a body falling freely along a radius from an initial 
state of rest dr/du = 0 at r = R > 2m. Since y 4l = c,y 44 = -c 2 ( 1 -2 m/r), 
equation (53.5) must be replaced by 


I /dr 

r(d-r - £(1 


d u » 

' 2m/r) d^ 1 = 0 


Together with the first integral 


. dr d u 

2c ss 


■ c 2 (1 — 2m/r) 


this leads to the following quadratic for dr/du; 


(57.15) 


(57.16) 



(57.17) 


(57.18) 


If r > 2m, one root is positive and one is negative. However, r must decrease 
initially (otherwise the square root in (57.18) becomes imaginary) and so the 
negative root is taken, r then decreases steadily to r = 0, its passage through the 
Schwarzschild radius being unremarkable. Once inside the Schwarzschild sphere, 
as already proved, the possibility of escape from the attraction no longer exists. 

The world-lines of photons moving radially are null geodesics governed by the 
equation 

2cdrdu - c 2 ( 1 - 2m/r)d« 2 = 0 (57.19) 

There are two families of such geodesics, viz. 



and 


d u 2 r 

dr r — 2m 


For the first family, equation (57.7) gives 


dt _ r 
dr r - 2m 


(57.20) 


(57.21) 


provided r > 2m. This corresponds to a photon moving towards the centre of 
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attraction. For the second family, we find 

d£ _ r 
C dr r-2m 


(57.22) 


in the same region; i.e. a photon moving away from the centre of attraction. A 
photon belonging to the first family crosses the Schwarzschild sphere and then 
falls into O. Inside this sphere, the photons can be separated into two classes; 
(i) those for which u is constant along a world-line- these photons could have 
their source outside the sphere; (ii) those f or which the second of equations (57.20) 
is valid and, hence, 

cu = 2r + 4m log (2m — r) + constant (57.23) 

As r -» 2m, u -* — x and these photons cannot have had an external source. Since 
du/dr < 0, these photons also fall into O. 

It is now clear that, in the field described by the metric (57.12), no photon or 
particle can cross the Schwarzschild sphere in the sense r increasing. On the other 
hand, any photon or particle which crosses the sphere in the reverse sense is 
absorbed and cannot return to the external world. The conditions inside the 
sphere are accordingly referred to as a black hole. It is thought possible by 
astrophysicists that some stars may have collapsed under their own gravitational 
attraction to a radius less than their Schwarzschild radius. In such a case, as 
explained above, further contraction would become irresistible and the star 
would collapse to a singular point having infinite density. Such a collapse would 
require an infinite time by terrestrial clocks so that, assuming the age of the 
cosmos to be finite, it might be objected that no such objects can yet have come 
into existence. However, the idea of a cosmos of present events, all happening 
simultaneously relative to some universal time scale, is quite foreign to relativity 
theory, so that the objection is meaningless. The hard fact is that the possibility of 
a spaceship falling into the black hole created by such an object, in a time which is 
finite measured by an on-board clock, is a real one. A few cases of objects which 
appear to be in the early stages of gravitational collapse have already been 
delected. 

If the metric (57.12) is transformed by changing the sign of u, another metric 
satisfying Einstein’s equation is generated. A similar analysis shows that this 
governs the field in the vicinity of a white hole , where matter and photons can only 
cross the Schwarzschild sphere in an outgoing sense. Thus, a white hole behaves 
as an irresistible source and a black hole as an irresistible sink. Being invariant 
under a sign reversal of t, the Schwarzschild metric permits a black and a white 
hole to exist together. 


58. Gravitational waves 

Throughout this section it is assumed that the gravitational field is weak and that 
the coordinates x' are quasi-Minkowskian, as explained in section 48. Thus, the 



160 


metric tensor is given by equation (48.3) and terms of second or higher degree in 
the or their derivatives will be neglected. We shall further suppose that the 
coordinate frame is harmonic (see Exercises 5, No. 50), so that the metric tensor 
satisfies the condition 

9^ = 0 (58.1) 

It can be proved that a transformation of the form x‘ = x‘ + ?'(*), where the 
functions are small with the /i^, can always be made so that the x-frame is 
harmonic (see Exercises 6, No. 37); this means that the harmonic coordinates will 
also be quasi-Minkowskian. 

To the first order, equation (58.1) reduces to 


[ii, k] = h ik i - \h,i k = 0 (58.2) 

Differentiation leads to 

h, k . ,j - lh,,. * = 0 (58.3) 

Exchanging indices j, k and adding the new equation to (58.3), we get 

+ *«./* = 0 ( 58 - 4 > 

The Ricci tensor has already been calculated to the first order of approximation 
at equation (49.2). Using the last result, this gives 

R* = (58.5) 

Also 

R = = « (58.6) 

Thus, Einstein’s tensor is given by 

Rj k~?9jkR = 2^jk. ii ~ i^jk^rr. it ~ ii (58.7) 

where h' Jk = h, k -^d lk h„ (58.8) 

Einstein’s equation of gravitation is now expressible in the form 

C 2 h) k = h ',k. a = ~ 2l<T ,k (58.9) 

The harmonic condition (58.2) can also be written 

^.i = 0 (58.10) 

In empty space, equation (58.9) reduces to 


□ 2 It. C l l L 

h j k — | -- — j + , t 4 - --—5 — 7 — 2 
dx 2 dy 2 dz 2 c 2 ct 2 


h )k 


(58.11) 


which is the wave equation, showing that gravitational waves are propagated in 
vacuo with the velocity of light. 
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In the case of a plane wave, we can write 

h' jk = exp (iTciX, ) (58.12) 

where it is understood that the real part of the complex exponential is to be taken. 
Equation (58.11) is satisfied provided 

M, = 0 (58.13) 

and the condition (58.10) also requires that 

M;* = 0 (58.14) 

Since A ik is symmetric, the last equation shows that the A i4 can be expressed in 
terms of the A# (»,/) = 1,2, 3). By further transformation of coordinates, it can 
be shown that all amplitudes can be expressed in terms of two parameters only 
and hence that gravitational waves have, essentially, only two modes of 
polarization. 

The solution of the wave equation (58.9) with source term —2 xT jk is well 
known to be given by Kirchhoff’s formula (Bateman, 1952): 


h jk (X 0 , t 0 ) 


K 

2 n 


l 


TfilxJo-r/i^dV 


(58.15) 


where x 0 = (x l 0 , Xq, Xq), x = (x l , x 2 , x 3 )are position vectors with respect to the 
origin Oof the frame in use,andr = |x 0 - x| is the distance between these points; 
V is the region of space over which T jk is non-vanishing. Note that t 0 is retarded in 
the integrand by a time r/c, since the effect of the source at x will not be felt at x 0 
until the time for its transmission over the distance r has elapsed. In the case of a 
source which is confined to a small region of space including the origin O, if 
r 0 = |x 0 | is large compared with the dimensions of this region, equation (58.15) 
can be approximated by 


h'jk.(Xo- ( o) = 2 “ j* T jk (x,t 0 - r 0 /c)dV (58.16) 


But, as explained above, the components (t' 4 can be obtained easily once the h 
have been found. We shall now show that a further simplification of the last 
formula is possible in these cases. 

First note that 


(7',.,x # ) .. = T rtr .x* + 7,0 (58.17) 

Since the divergence of T tj vanishes, we have 

T„, + T„.< = 0 (58.18) 

and thus 

(T ri x% = T* f -T^ A xf> (58.19) 
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Integrating over the region V, the integral of the left-hand member is seen to be 
zero by application of Gauss's divergence theorem (assuming T,. vanishes over 
the bounding surface); we accordingly obtain the result 


r 

r* . 

n t d 


T a 4 A x fi 6V= ---- 
cat % 


Jr^xME (58.20) 

Exchanging the indices a, /( and adding the new identity to (58.20), we find 


= (58.21) 

We next integrate the identity 

(7' y4 x a x < ') .. = 7V, .j.x’x*' + T, i x / +T Bi x* (58.22) 


over V. By the divergence theorem, the integral of the left-hand member vanishes. 
Hence 


(7; 4 x* + 7* 4 xW = 



y x 3 x"d V 


Equations (58.21) and (58.23) now yield the result 


(58.23) 


j r "-' xV iv - -hh\ T ~* x ‘* ,iV 

- -jp&\ T “ x ' x ' iV 158241 

where we have again made use of the equation T (l J = 0. But, equation (21.14) 
shows that r 44 = - nc 2 and equations (58.16) and (58.24) therefore lead to the 
final result 

G r| ~ /* 

*^(x o .0= - 4 - Ti q(x, f -r 0 /c)x 7 x p dV (58.25) 

( 1 0 Ol J 

It should be noted that it is the second time derivative of the second moment of 
the mass distribution which is responsible for the gravitational wave. In the 
corresponding electromagnetic situation, it is the second time derivative of the 
first moment of the charge distribution which is responsible for the elec- 
tromagnetic wave. 

Instruments have been devised to detect the small variations in the gravi- 
tational field caused by waves proceeding from possible sources within the galaxy 
(e.g. pulsating neutron stars, binary stars or supernova explosions), but no clearly 
unambiguous results have yet been obtained. Such instruments attempt to 
measure the small strains induced in very large masses of metal by the tidal forces 
caused by the passage through them of gravitational waves. 


T ^ dy= hdt 
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Exercises 6 

1. If the v-frame is defined as in section 47, show that the metric tensor in the x- 
frame is given by 


Hence lower the index j in defined by equation (47.2) and show that the result is 
Tj as defined in equation (47.3). 

2. Given that space- time has the metric 

ds 2 = dx 2 +dy 2 + e 2 "dz 2 -e 2<, dt 2 

where 0 , cf> are functions of z only, prove that the Riemann-Christoffel tensor 
vanishes if, and only if, 

<t>" —t)'(j)‘ + 0' 2 = 0 

where dashes denote differentiations with respect to z. If (p = — 0, prove that the 
space -time is flat provided $ = j log (a + bz), where a , b are constants. 

3. If space time has the metric 

ds 2 = e^dr 2 + dr 2 ) + r 2 e~ p d</> 2 -e p dr 2 

where /., /» are functions of r and z only, show that the field equations in empty 
space K it = 0 require that /. and p should satisfy the equations 

'i +Pi = ?r(p\ -pi) 

'■2 + Pi = rpip 2 
Pi 1 + P22 + rPl =0 

/. I ! + /-22 + P l 1 + P22+i(Pt+P2) = 0 

where subscripts I and 2 denote partial differentiations with respect to r and z 
respectively. 

4. If space time has the metric 

ds 2 = e^ldx 2 + dy 2 + dz 2 - dr 2 ) 

where k is constant, and t 2 = x 2 + y 2 + z 2 , dots denoting differentiations with 
respect to r, show that for a freely falling body 

I — i 2 = (1 — V 2 )e 2kx 

where 1 = V at x = 0. 

5. If space- time has the metric 

ds 2 = a 2 (dx 2 +d>’ 2 +dz 2 ) — c 2 adr 2 

where a = 1/(1- fcx)and k is constant, and 1 is as defined in the previous exercise. 



164 


prove that for a freely falling body 

V 2 - i 2 = kc 2 x 


where r = V at x = 0. 

6. The space time metric over a certain region of empty space is 

ds 2 = e*(dx 2 -fdy 2 + dz 2 ) — e^d/ 2 

where a. )S are functions of z alone. Show that Einstein’s equation is satisfied 
provided 

y“ + W 2 + \x'[S' = 0 
a* + 4/T + lli 2 -la I? =0 
P" + \P 2 + \*P‘ = o 

Deduce that t* = A{k - r) 4 , = B(k — z) 2 . where A, B, k are constants. 

7. Show that the space- time metric 

d.v 2 = e*dr 2 + r 2 dd 2 + e^dz 2 — e'dt 2 

where r. 0. z are quasi-cylindrical polar coordinates and t is the time and a. [i yare 
functions of r alone, satisfies Einstein's equation in vacuo, provided 

P" +7" +{P‘ 2 + 3 '/' 2 - fa' - \y’P’ - ya'y' = 0 
a' = IS' + )■' 

F + W 2 -\*'fr+tf?+Ut' = o 

7" + Vi' 2 - iay + \P'7' + 7 i = o 

dashes indicating differentiations with respect to r. Deduce that 

e 2 = Ar ' 2 *"', e“ = Br f e> = Cr u 

where A. B. C. p are constants and 7.^i = 2(/. + n). 

8. A certain region of space- time has metric 

d.s 2 = dx 2 + dy 2 + dz 2 - x 2 d( 2 

A particle is stationary at the point x — 1, y = z = 0 at / = 0. If the particle is 
released at this instant and falls freely, show that it moves along the x-axis with 
equation of motion x = sechr. A photon is emitted from the point ( 1, 0, 0) at t = 0 
in the direction of the positive y-axis. Show that at this instant x = i = 0, y = 1 
and that the path of the photon is the circle x 2 + y 2 = 1. 

9. De Sitter’s universe has metric 

ds 2 = A " ‘dr 2 + r 2 (d0 2 +sin 2 0d0 2 ) - Ac 2 dt 2 

where A = 1 - r 2 ;R \ R being constant. At r = 0, a photon leaves the origin r = 0 
and travels outwards along the straight line 0 = constant, <p = constant. Find its 
coordinate r at time f and show that r = %R when t = R (log 3)/ 2c and that r R 
as t -» oc. 



10. r, 0. : are quasi-cylindrical coordinates in a gravitational field determined 
by the metric 

ds 2 = r 2 (dr 2 + d0 2 ) + r( dz 2 - d( 2 ) 

A particle is projected from the point r = 1. 0 = 0, z = 0 in the held with such 
velocity that r = z = 0, 0 = 3/2 (dots denote differentiations with respect to f). 

Prove that, if the particle falls freely, it moves in the plane z = 0 between the 
circles r = 1, r = 3. first touching the outer circle where 0 = y/in. A photon is 
emitted from the point r = 1,0 = 0, z = 0 and moves initially so that r = z — 0. 
Prove that its path is the spiral r = 1 + £0 2 in the plane z = 0. 

11. The metric for de Sitter’s universe can be expressed in the form 


d.s 2 = e 2 “ *(dx 2 + dy 2 + dz 2 ) - c 2 dr 2 

where R is a constant and x, y, z can be treated as rectangular Cartesian 
coordinates. Show that the trajectories of freely falling particles and photons are 
straight lines. A particle is projected from the origin at t = 0 with a velocity V 
along the positive x-axis. Prove that its x-coordinate at time t is given by 


Vx = R [< - v ' (c 2 - V 1 + V 2 e - *)] 

A body at the point x = X on the x-axis emits a photon towards the origin at 

ft 

I = 0. Show that the photon arrives at O at time t = - log(l - X/R). Discuss 
the case where X > R. 

12. r, 0 , 0 are quasi-spherical polar coordinates in a gravitational field which is 
spherically symmetric about a centre of attraction r = 0. The space-time metric is 


d.s 2 =1^—^—^ dr 2 + r 2 d0 2 + r 2 sin 2 0d0 2 — df 2 

\r + l J r + 2 

A particle is projected from the point r = 1, 0 = jtt, 0 = 0 at t = 0 with velocity 
such that r = 0, 6 = 0, 0 = 1/^/6 and falls freely. Show that the particle’s 
trajectory lies in the plane 0 = {n and has polar equation 


5 — cos(«0) 
3 + cos(«0) 


where a = (8/3). Deduce that the particle moves between two circles of radii 1 

and 3 and calculate the increment in 0 between two successive contacts with one 
of these circles. (Ans, 2 n/a.) 

13. Taking the metric for de Sitter’s universe in the form stated in exercise 9, 
find equations of motion for a particle projected from the point r = \R. 


0 = 2 rr, 0 = 0 with such velocity that f = 0 = 0, $ = 


and thereafter falls 
2 R 


under gravity. Show that its trajectory lies in the plane 0 = \n and that its polar 



166 


equation is 

r = fl.(5cos 2 0 - l) 1 - 2 . 

14. Oxyz is a quasi-rectangular Cartesian coordinate frame constructed in a 
certain gravitational field. If t is the time measured by a system of clocks 
stationary in the frame, the space-time metric is ds 2 = z(dx 2 + dy 2 + dz 2 — dr 2 ). A 
particle is projected from the point (0,0, I) at t = 0 with velocity components 
x = l?( < 1), y = z — 0 and thereafter falls freely. Show that its trajectory is a 
parabola and lies in the xz-plane and that the particle arrives at the xy-plane at 
time t = 2 < n /(1 - e 2 ). 

15. Show that Einstein’s equation (47.16) can be written in the form 

R'’ + Kg ' 1 = K[{lg' J — I '•') 

16. Inside a static gravitating homogeneous sphere of liquid, the proper 
density is p. (a constant) and the pressure is p. The energy-momentum tensor has 
zero components except for T [ = T\ = T] = p. 7 J = - c 2 p. Assuming that the 
metric of the field inside the sphere is given by equation (51.10) with a = e 2 
and b = e^, show that Einstein's equation (47. 1 5) can be satisfied by making x. [i 
and p satisfy the equations 

y-{r(l - e J )J = xc 2 ///- 2 

dr 

/f = -(e 1 - 1) + Kre*p 

r 

P" + 2/f 2 - yx'/f - - x = kt jl (p - c 2 //) 

r 

dashes denoting differentiations with respect to r. Assuming x = 0 at r = 0 and 
p = 0 at r = a (the surface), deduce that 

= 1 -tyr 2 

where q = Kc 2 p/3. and that 

, (1 -qr 1 ) 1 2 -<l -qa 2 ) 1 2 

P ‘ "3(1 -qa 2 )' 2 - (f-zyr 2 ) 1 2 

17. Obtain the equations of motion of photons moving radially inside the 
Schwarzschiid sphere and deduce that a photon moving away from the centre O 
takes an infinite coordinate time f to reach the sphere and a photon moving 
towards the centre from r = R( < 2m) takes a time r = 7 given by 

c7 = - R - 2mlog( 1 - R 2m) 

to reach O. 

18. Obtain equations (57.2) and (57.3) for a body falling freely towards the 
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centre of attraction in the region r > 2m and hence prove that 
R 


ci - 


2m 


- I 


{ r ( /? - r)} + (R + 4m)cos l (r/R) 


1/2 


- 2m log 


where 



2 m(R - r) 1/2 

7 = Lr(« _r 2mj_ ' 

Deduce that t -» oc as r -» 2m. 

19. Obtain equations (57.4) and (57.5) and deduce that the time recorded on a 
standard clock attached to a freely falling body as it falls from the Schwarzschild 
sphere to the centre of attraction is nm/c. Calculate this time in the case of a black 
hole having solar mass. (Ans. 16 /js.) 

20. Verify that the metric tensor given by equation (57.12) satisfies Einstein’s 
equation in empty space. 

21. Show that the Kruskal-Szekeres transformation 


u = (r/2m — l) 1 ' V ,4,n cosh(cr/4m) 
i = (r/2m — l) 1( V' 4m sinh(ct/4m) 

converts the Schwarzschild metric to the form 

ds 2 = :2m (du 2 — du 2 ) + r 2 (d0 2 + sin 2 0d$ 2 ) 

r 

where r is given in terms of u, v by the equation 

u 2 -v 2 = (r/2m — l)e rl2m 

Deduce that the world-lines of radially moving photons are u + v = constant. 

22. Show that the transformation 


u = i- + - r 
3 a 


3/2 


= l + 2ar 112 -a 2 \og-yjY- 


v =- 


where a 2 = 2m, puts the Schwarzschild metric into the form 


ds 2 = n 2 (u - c) 2,3 du 2 + fj 2 (u - c) 4/3 (d 6 2 4- sin 2 0d$ 2 ) - dr 2 
where ^ = 9« 2 /4. 

23. A photon is emitted from the point r = m, 6 = <p = 0, inside a black 
hole (Schwarzschild coordinates) with angular velocities () = 0, = 3^3 c/m. 

Show that f = + 2y/lc initially. In the case when the initial value of f is negative, 
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show that the photon moves in the plane 6 = ^rcand falls into the centre along the 
trajectory 

— = 3coth 2 ^(a -<p)~ 1 
r 


where a = log^(5 + ^21). 

24. Show that the only possible circular orbits for a photon in a Schwarzschild 
field all have radius r - 3m and that their period in coordinate time is 
Show that these orbits are unstable. 

25 A body moves in a circular orbit of radius r in the plane fl = ^ in a 
Schwarzschild field. Show that r > 3m and that the angular velocity d0/dt is 
related to r in the same way as in classical theory. Show that the period of the 
motion as measured by a standard clock attached to the body is 


2 nr 
c 



Show, also, that the period as measured by an observer using a standard clock 
who is stationary at some point on the orbit is 



Show that the orbit is unstable if 3m < r < 6m, but is stable otherwise. 

26. r, 0, 0 are Schwarzschild coordinates. A fixed observer at the point ft , 0, 0 
transmits a wireless signal radially towards the attracting body. The signal is 
reflected by a small body at the point r, 6, 0 and returns to the observer. Show that 
the lime elapsing between transmission and reception as measured by the 
observer’s standard clock is 


2 

-(1 -2 m/ft ) 1 ' 2 


| R -r + 2mlog 


ft - 2m \ 
r - 2m / 


Calculate the distance covered by the signal and deduce that, according to 
classical theory, the time for the double journey would be 


(ft 2 -2mft)’- 2 -(r 2 -2mr) 1/2 +2mlog 


ft 1 2 + (ft -2m) 1 
r 1 ' 2 + (r -2m)‘ 


Show that, to the first order in m, the difference between these times is 


2m 

c 


. ft r 
k,g 7+ --> 


{Note'. This result suggests a method of checking the general theory using the 
Sun’s field and Mercury or Venus as the reflector.) 
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27. An atom, which is stationary at a Schwarzschild coordinate distance r from 
the centre of a spherically symmetric body, emits light of frequency v which is 
observed by a stationary observer at a coordinate distance R ( > r) from the 
centre. Show that the observed frequency is v-<5v, where 

_i) 

to the first order in m. 

28. By replacing the spherical polar coordinate r occurring in the 
Schwarzschild metric (52.10) by a new coordinate r' where 



obtain this metric in ‘isotropic’ form, viz. 

ds 2 = ^l + ™ ; ^ (dr' 2 +r' 2 dfl 2 +r' 2 sin 2 0d$ 2 )-^| r 2 dt 2 

29. Employing a certain frame, an event is specified by spatial coordinates 
(x, y, z) and a time t. The corresponding space-time manifold has metric 

ds 2 = dx 2 +dy 2 + dz 2 + 2atdxdr - (c 2 —a 2 i 2 )dt 2 

Show that a particle falling freely in the gravitational field observed in the frame 
has equations of motion 

x = A + Bt - {at 2 , y = C + Dt, z = E + Ft, 

where A, B , C, D, E, F are constants. By transforming to coordinates (x\ y, z, r), 
where x' = x + jc/f 2 , and recalculating the metric, explain this result. 

30. (x 1 , x 2 , x 3 )are spatial coordinates of an event relative to a frame Sand x 4 is 
the time of the event measured by a clock in S. A second frame / is falling freely in 
the neighbourhood of the event and may be regarded as inertial. Oy‘>' 2 >’ 3 are 
rectangular Cartesian axes in / and y*/ic represents the time within / as measured 
by synchronized clocks attached to the frame. Show that the metric tensor in S, 
is given by 

dy k dy k 

9,J = d^Ix 1 ' 

P is a point, fixed in S, having coordinates (x l , x 2 , x 3 ). At the instant x 4 , / is 
chosen so that P is instantaneously at rest in /. Deduce that 

g>’ 4 ^ 9a 
dx 1 7(044) 

at x‘ ,J1 is the distance between P and a neighbouring point 
P’(x‘ +dx\ x 2 +dx 2 , x 3 +dx 3 ) 
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as measured by a standard rod in / at the instant x 4 . Prove that 
d I 2 = dy'dy’ = y^dx'dx" 
where a, g range over the values I, 2, 3, and 


try 0/i/i 

044 

(■/,„ is the metric tensor for the 3 which is S at the instant x 4 .) 

31. Oxy-c is a rectangular Cartesian inertial frame I. A rigid disc rotates in the 
xv-plane about its centre O with angular velocity tu. Polar coordinates (r, 9) in a 
frame R rotating with the disc are defined by the equations 

x = rcos(0 + cut), >■ = r sin (0 + air) 

where t is the time measured by synchronized clocks in the inertial frame. If the 
time of an event in R is taken to be the time shown by an adjacent clock in /, show 
that the space time metric associated with R is 

d.s -2 = d r l + r 2 d0 2 4- 2ior 2 d9dt - (t 2 -r 2 0 ) 2 ) di 2 


Deduce that the metric for geometry in R is given by 


d I 2 = dr 2 + 


r 2 d0 2 
- u> 2 r 2 /c 2 


(Him: employ the result of the previous exercise.) Hence show that the family of 
geodesics on the disc is determined by the equation 


0 = const, - sin 



a 

IT 


y/(r 2 -a 2 ) 


where r t = c/a) and |a| < r,. Sketch this family. What is the physical significance 
of r ! ? 

32. x'(i = 1, 2, 3, 4) are three space coordinates and time relative to a reference 
frame S. A test particle is momentarily at rest in S at the point (x\ x 2 , x 3 ) at the 
time x 4 . If g u is the metric tensor for the gravitational field in S, write down the 
conditions that the world-line of the particle is a geodesic and deduce that 

d 2 * 4 Ucg tt gu CQtA eg h 

y,4 (dx 4 ) 2 2\ ex' 044 £x 4 / cx 4 


at the point x‘. Hence show that the covariant components of the particle’s 
acceleration in S are given by 


d 2 x" 

(d7 


/ 30 


= -~-(c 2 +2U)' 


, 


where y xS is defined in exercise 30 and 


044 = - (C 2 + 21/), ’/. = 0 a4 / V ( “ 04«) 
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(U, y„ are the gravitational scalar and vector potentials respectively.) 

Show that, in the case of the space time metric appropriate to the rotating 
frame of exercise 31, the gravitational vector potential vanishes and the scalar 
potential is given by U = jw 2 r 2 . Interpret this result in terms of the centrifugal 
force. 

33. De Sitter's universe has metric 

d.s 2 = A ‘dr 2 + r 2 d() 2 + r 2 sin 2 Wd$ 2 - Ac 2 dt 2 


where A = 1 -r 2 /R 2 , R being constant. Obtain the differential equations satis- 
fied by the null geodesics and show that along null geodesics in the plane 0 = - 2 n, 

a~ = r(r 2 - a 2 )' 2 
d <p 

where a is a constant. Deduce that, if r, $ are taken to be polar coordinates in this 
plane, the paths of light rays in this universe are straight lines. 

34. Einstein’s universe has the metric 

d.v 2 = .- 5 dr 2 + r 2 d0 2 + r 2 sin 2 Od0 2 -c 2 dt 2 

1 - /.r 


where (r, 0, $) are spherical polar coordinates. Obtain the equations governing 
the null geodesics and show that, in the plane 0 = {n, these curves satisfy the 
equation 

dr V 

d j-J = r 2 (l -;.r 2 )(/rr 2 - 1) 

where n is a constant. Putting r 2 = l/r, integrate this equation and hence deduce 
that the paths of light rays in the plane 0 = \n are the ellipses 


/.x 2 + ny 2 


1 


where (x, y)are rectangular Cartesian coordinates. Show, also, that the time taken 
by a photon to make one complete circuit of an ellipse is 2 n/(c/.' 2 ). 

35. If the metric of space time is 

d.s 2 = a 2 (dx 2 +dy 2 + dr 2 ) -Aotdf 2 

where a is a function of x alone and k is a constant, obtain the differential 
equations governing the world-lines of freely falling particles. If x, y, z are 
interpreted as rectangular Cartesian coordinates by an observer and r is his time 
variable, show that there is an energy equation for the particles in the form 


ft 2 -— = constant 
2 2x 

36. ( r,0,<p,i ) are interpreted as spherical polar coordinates and time. A 
gravitational field is caused by a point electric charge at the pole. Assuming that 
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the space-time metric is given by equation (51.10) and that the 4-vector potential 
for the electromagnetic field of the charge is given by O, = ( 0 , 0 , 0 ,/). where 
/ = /(r), calculate the covariant components of the field tensor F tJ from equation 
(26.6) and deduce the contravariant components F‘‘. Assuming that J‘ = 0, prove 
that Maxwell’s equations (56.4) and (56.5) are all satisfied if 


d/ 

dr 


An e 0 r 2 


> /M > ) 


where q is a constant. 

Calculate the elements of the mixed energy-momentum tensor from equation 
(56.6) and write down Einstein’s equations (47.15) for the gravitational field. 
Show that these are satisfied provided 

1 2m q 2 G 1 

-=6=1 +— - — T 

a r 47t£ 0 c r l 

where m is a constant. 

37. If the coordinates x' are quasi-Minkowskian so that the metric tensor is 
given by equation (48.3), show that the transformation x' = x' + 4'(x) makes the 
x-frame harmonic provided the 1‘ satisfy the conditions 


Z'.jj = h.j.j-hhj,., 


(Neglect second order terms in the 6 , ; and use the condition given in Exercises 5, 
No. 50.) Show also that h ti = 0 provided the functions £' satisfy the additional 
condition = j 6 fj . If the x-frame is harmonic before transformation, show that 
the x-frame is also harmonic provided jj = 0 . 

38. A sphere of mass M is expanding in such a manner that its density remains 
uniform. If er(r) is its radius at time (.show that, at a large distance r from its centre, 
the gravitational wave generated has components 

4 GM , 

h, I = «22 = «33 = (“ + «“) 

5 re 

the components 612 , 623,631 being «ro. (The bracketed expression is to be 
calculated at the appropriate retarded time.) 

39. A uniform rod of mass M and length 2 a is pivoted with its centre at the 
origin of the x-frame and rotates in the x 2 x 3 -plane with angular velocity w. Show 
that, at a large distance r from the rod, the gravitational wave generated has non- 
zero components 

6' 22 = -633 = /lcos 2 a»(, 623 = /lsin 2 o>r 


where A = 4GMa 2 a> 2 /7>rc* and the instant 1 = 0 has been chosen appropriately. 

40. Show that, if the cosmical constant term is retained in Einstein's equation, 
it reduces in empty space to R lt + A g tj = 0. Deduce that the spherically symmetric 
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Schwarzschild solution (cf. equation (52.9) ) is given by 


b = 


2 m 
r 



Using the approximate equation (48. 17), show that this implies the existence of an 
additional force of repulsion from the centre proportional to the radius r. 



CHAPTER 7 


Cosmology 


59. Cosmological principle. Cosmical time 

Cosmology is the study of the large-scale features of the universe, such as the 
distribution and motions of the galaxies and the density of radiation and dust 
through intergalactic space. It is also concerned with the manner in which these 
features can be expected to change over very long periods of time measured in 
billions (10 9 ) of years, i.e. with the evolution of the cosmos. Such calculations also 
throw light on the stages through which the cosmos has passed to arrive at its 
present state, and attempt to answer the question, did the universe have a 
beginning in time or has it always existed? If the universe had a beginning, as the 
evidence now strongly suggests, the study of its state during the very early stages 
of its evolution is called cosmogony. 

Since the galaxies are electrically uncharged, the only force influencing their 
motion is gravity. Thus, cosmology is necessarily founded on a theory of 
gravitation. It has been shown (see, e.g., Bondi, 1960) that the Newtonian theory is 
quite capable of generating models for the cosmos which provide explanation 1 
for many of its observed features. However, these models necessarily assume that 
space is Euclidean, whereas Einstein's theory indicates that, in the presence of a 
gravitational field, space becomes curved and its geometry is then Riemannian. 
The curvature generated by the gravitational attraction of a galaxy is inap- 
preciable and may be disregarded so long as we confine our attention to regions of 
space whose dimensions are comparable with those of a galaxy, but this effect has 
major consequences when the spatial extension of the whole cosmos is 
considered; in particular, as we shall see, a possible consequence is that the total 
volume of space is finite and, therefore, that the universe is not potentially infinite 
in extent as a Newtonian cosmology must assume. Only cosmic models which are 
in accord with general relativity theory will accordingly be studied. 

The reader will be presumed familiar with the basic facts relating to the 
distribution of matter and radiation over the cosmos as it is observed in the 
present epoch (Rowan-Robinson, 1979). The mass of the radiation is roughly 
one-thousandth of the mass of the galactic matter and its gravitational effect is 
therefore negligible by comparison with the attraction of the galaxies. However, 
at earlier epochs, the contribution to the gravitational field of the radiation was 
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probably much more considerable and, during the first million years after the ‘big 
bang’, it is thought that the cosmos was dominated by its radiation; this radiation 
is assumed to have been in equilibrium with the matter and hence to have 
acquired a black-body frequency distribution. The remnant of this black-body 
radiation in the present epoch was detected by Penzias and Wilson in 1965 and 
this still forms the major part of the total cosmic radiation. It is not known what 
proportion of the matter in the universe has been attracted into the galaxies; the 
density of matter in intergalactic space is certainly so small as to have no 
observable effect on the light transmitted through these regions from the most 
distant sources, but the volume of space is so large that this observation is not 
inconsistent with the hypothesis that the net mass of intergalactic matter is many 
times that of the matter present in the galaxies. As will be seen, our ignorance in 
regard to this datum prevents our reaching a firm conclusion whether the cosmos 
is finite or infinite in extent. Although the galaxies often occur in clusters, from 
our viewpoint their overall distribution appears to be isotropic and homoge- 
neous. At very great distances, the galactic density is observed to increase, but it 
must be remembered that such observations are carried out by light which was 
emitted at a much earlier epoch when all the galaxies are thought to have been 
closer together; it is assumed that, at the ‘present cosmical time’ ( precise definition 
follows later), the density of galactic mass is uniform throughout the cosmos. 

That the galactic density is decreasing as the universe evolves is in accordance 
with the observed recession of the galaxies. To be more precise, what is observed is 
that the spectrum of the light from a distant galaxy is shifted towards the red end 
of the spectrum by an amount which is approximately proportional to its 
distance. This is Hubble's law. The reduction in frequency is interpreted as a 
Doppler effect caused by the motion of the galaxy away from the observer along 
the line of sight. Since it is supposed that the whole universe is in a state of 
expansion, each galactic observer will experience a recession of all the other 
galaxies in accordance with Hubble's law. Clearly, if matter is conserved during 
this expansion, a steady reduction in its density is inevitable and there is now an 
accumulation of evidence that the matter density was indeed greater in the distant 
past than it is today. However, a steady-state cosmology (Bondi and Gold. 1948, 
Hoyle, 1948) has been proposed in which the galactic density remains constant 
due to the continuous creation of matter in intergalactic space; this matter 
condenses into new galaxies and so maintains a steady-state distribution. 

As a first step towards the construction of a mathematical model of the cosmos, 
we shall treat the galaxies as point masses or molecules forming a galactic gas and 
further assume that this gas behaves like the perfect fluid studied in section 22. In 
particular, its energy -momentum tensor will be supposed given by equation 
(22.21). At the present epoch, this gas is exceedingly rarefied and, since the random 
motions of the galaxies relative to the background black-body radiation (which 
provides a natural frame of reference) are of relatively small magnitude, the 
pressure associated with the gas is very low; thus, at this and later epochs, it will be 
permissible to neglect the pressure and to treat the gas as an incoherent dust 
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cloud. However, during the early phases of cosmic evolution, the temperature and 
pressure are believed to have been very high indeed and the pressure terms cannot 
then be neglected; further, during these phases the contribution of the 
background radiation becomes significant and terms representing this contri- 
bution must also be included in the energy-momentum tensor. 

We next assume that there are no privileged galactic observers, i.e. all observers 
moving with the galactic gas will be assumed to see the same large-scale process of 
evolution of the cosmos. This is the cosmological principle. The steady-state 
theory is based upon an extension called the perfect cosmological principle-, this 
asserts that all galactic observers see the same large-scale state of the cosmos at all 
times. Observation supports the first principle but not the second. 

If the cosmological principle is accepted and the perfect principle rejected, it is 
possible to define an absolute cosmical time , i.e. a way of assigning times to cosmic 
events which is independent of the observer. For all galactic observers will 
experience the same process of cosmic evolution and the various characteristic 
stages of this process can be allocated times according to some agreed scale. It is 
not necessary at this point in the argument to tie the scale to time measured by 
standard clocks; we only require that the later stages of an observer’s experience 
be allocated times which are greater than the times allocated to earlier stages. 
Then, the cosmical time of any event can be defined unambiguously as the time 
recorded for the event by an adjacent galactic observer using the agreed time 
scale. Thus, the state of the cosmos at any epoch is now defined to be the set of 
events whose cosmical times are all equal to the cosmical time of the epoch. Since, 
at a given epoch, all galactic observers will be experiencing similar processes, the 
large-scale state of the cosmos at a given epoch must be homogeneous and 
isotropic for each such observer. 


60. Spaces of constant curvature 

At a given epoch, as we have just seen, the state of our cosmological model must 
be homogeneous and isotropic. In particular, the three-dimensional space in 
which the model is constructed must have these properties. The surface of a 
sphere is a two-dimensional space of this type embedded in <f 3 and it is obvious 
that a three-dimensional hypersphere embedded in c? 4 will have all the 
characteristics we need for our purpose. Also, just as the ordinary sphere includes 
the <? 2 plane as a special case when its radius becomes infinite, a hypersphere of 
infinite radius will correspond to <f 3 , which is an especially simple case of a 
homogeneous and isotropic space. In addition, we shall be led quite naturally to 
consider a third class of such spaces which, like S-$, but unlike the hypersphere, 
have infinite volume. These three types of space all have constant curvature 
scalars R and are therefore called spaces of constant curvature. All these spaces 
have positive-definite metrics, as they must have if their geometry is to be 
Euclidean over sufficiently small regions. It may be proved that there are no other 
Riemannian spaces having such metrics which are homogeneous and isotropic. 
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Let (x, y, z, u) be rectangular Cartesian coordinates in <f 4 . Then, a hypersphere 
of radius S has equation 

x 2 + y 2 + z 2 + u 2 = S 2 (60.1) 

Since u is determined in terms of x, y, z by this equation, a coordinate frame for 
points on this hypersurface can be constructed by first allocating coordinates 
(x, y, z) to the point having coordinates (x, y. z, u ) in the Cartesian frame. Provided 
x, y, : are small by comparison wiih S, they will behave approximately like 
rectangular Cartesian coordinates in S } and we shall therefore define quasi- 
spherical polar coordinates (r, 6. <f>) by the usual transformation equations 

x = rsinfleos^, y = r sin 0 sin </>, z = rcosO (60.2) 

Equations (60.1) and (60.2) give 

u 2 = S 2 — r 2 (60.3) 

from which we find by differentiation that 

du 2 = — dr 2 = — T dr 2 (60.4) 

u 2 S -r 2 

Hence, the distance d.s between the points (x, y, z, u) and (x + dx, y + dy, z +dz, 
u + du) on the hypersphere is given by 

d.v 2 = dx 2 +dy 2 +dz 2 +du 2 

= dr 2 + r 2 (dO 2 + sin 2 0 d 0 2 ) + r 2 dr 2 /(S 2 - r 2 ) 

S 2 

= 5 - dr 2 + r 2 (df ) 2 + sin 2 B d$ 2 ) (60.5) 

S 1 — 

This is a metric for the hyperspherical Clearly, by taking 1/S 2 = 0, the metric 
reduces to that for <? 3 in ordinary spherical polars. 

If the curvature scalar is calculated from this metric, it will be found that it 
equals - 6/S 2 , i.e. is constant. It is now evident that if S 2 is replaced by - S 2 , the 
curvature scalar will still be constant with value 6 . S 2 and the space will remain 
homogeneous and isotropic. This is the third type of such a space; its metric is 

S 2 

ds 2 = — , j dr 2 + r 2 (d 0 2 + sin 2 0d<p 2 ) (60.6) 

S 2 + r 2 

Since these spaces are homogeneous and isotropic, the pole r = 0 can be taken 
to be any point and the axes from which 0 and <f> are measured can be taken in any 
pair of perpendicular directions. If r is small compared with S, both metrics (60.5) 
and (60.6) approximate to the spherical polar metric for r^ 3 , implying that the 
spaces are Euclidean over small regions. 

Consider the circle r = constant, 0 = jn, in the space with metric (60.5). The 
distance between neighbouring points <£, <p+d<t> on the circle is given by the 
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metric to be d.v = rd<p and the circumference of the circle is accordingly 2 nr. 
Along a radius of this circle d(l = d(p = (land the distance between points r. r ■+■ dr 
is given to be 

d.s = Sdr, v (S 2 - r 2 ) (60.7) 

Integrating from r = 0 to r, the length of the radius is found to be 

p = S sin 1 |r S) (60.8) 

Thus, /• = .S' sin (p, S') and the circumference < is given in terms of the radius p by 

c = 2 nS sin (p S) (60.9) 

Since sin (p/S) < p/S, c is smaller than 2np, which is the Euclidean result. 

The formula (60.9) receives a simple interpretation in the allied case of the 
which is the surface of an ordinary sphere of radius S. The quantities S. r , p are 
indicated in Fig. 8, from which the relationships just found are readily seen to be 
valid. Clearly, as p increases, r first increases until it achieves a maximum value S, 
and thereafter decreases until p = nS, when r becomes zero. It now appears that 
our coordinate system is ambiguous, in that two different points can have the 
same coordinates; this deficiency can be rectified by replacing r by p using the 
transformation equation (60.8), giving a new metric 

d.v 2 = dp 2 + S 2 sin 2 (p/S)(d0 2 + sin 2 ()d$ 2 ) (60.10) 

This transformation has also eliminated the singularity in the metric (60.5) at 
r = S. Like the Schwarzschild singularity, this is a property of the coordinate 
frame and evidently docs not correspond to a singularity in the space itself. 

Putting p/S = the metric (60.10) can also be expressed in the convenient 
form 

d.v 2 = S 2 [di/t 2 + sin 2 i/»(d 0 2 + sin 2 (7d0 2 ) } (60.1 1 ) 

(See Exercises 5, No. 33.) 



Fig. 8 
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In the case of the space with metric (60.6), r can assume all positive values and is 
not ambiguous. By putting r = S sinh^, this metric can be transformed to 

ds 2 = S 2 {dip 2 + sinh^fdO 2 + sin 2 0d$ 2 )} (60.12) 

Another form for these metrics which will be specially important later is 
obtained by putting r = So. The metrics (60.5) and (60.6) can then both be 
expressed by 

ds 2 = S 2 


do 2 
1 —kc 


+ <r 2 (d0 2 + sin 2 f)d$ 2 ) 


(60.13) 


where k = 1 in the case (60.5) and k = - 1 in the case (60.6). The special Euclidean 
case can also be accommodated by permitting k to be zero. The new coordinate o 
is dimensionless and, if k = 1, is restricted to values satisfying 0 cr < 1. For the 
other values of k, o takes all positive values. 

To calculate the volume of some region of an £# 3 , let y* (a = 1, 2, 3) be geodesic 
rectangular Cartesian coordinates in the neighbourhood of some point P (section 
39). Assuming that the metric is positive definite (as in the present case), the y a will 
all be real. Then, if x’ are coordinates with respect to any other frame, the metric 
tensor in the x-frame will be given by 


ex' dx f '* 8x x dx f 


(60.14) 


Hence, if the Jacobian determinant c(y l , y 2 , > ,3 )/e5(x' , x 2 , x 3 ) is squared by 
multiplying its rows by its columns, it follows that 


J’ 3 ) 

c(x l , x 2 , x 3 ) 



= g 


(60.15) 


But, the volume <5Fof a small region A in the neighbourhood of P is given by 


3V: 


dy l dy 2 dy 


= 1 


r>(x\ x 2 , x 3 ) 


dx l dx 2 dx 3 


(60.16) 


Thus, the volume enclosed by the coordinate surfaces x l , x 2 , x 3 , x l +dx‘, 
x 2 +dx 2 , x 3 +dx 3 is 


dV = ^'gdx 1 dx 2 dx 3 (60.17) 

The formula for the volume V of a finite region F of now follows, viz. 


y/g dx 1 


dx 2 dx 3 


(60.18) 


In the case of the space with metric (60.5), g = S 2 r 1 sin 2 0/(S 2 -r 2 ) and the 
whole space has volume 


f* . f " , , f 2 ” Sr 2 sin 6 , 


(60.19) 
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where the factor 2 is needed since the range 0 < r < S covers only half the sphere 
(see above). Performing the integrations, this volume is found to be 2n 2 S 3 . 

It will be found that the total volume of the space with metric (60.6) is infinite, 
as in the Euclidean case. 


61. The Robertson-Walker metric 

In this section, we shall calculate a space-time metric for the cosmos in a frame 
formed by clocks moving with the galactic gas, all reading cosmical time x 4 . As 
explained in section 45, the spatial coordinates x J of each clock never change and 
the frame is therefore said to be co-moving with the gas. 

Viewed from any one of the clocks, the cosmos is isotropic, i.e. it is impossible 
to specify any direction having special properties. Let g h be the metric tensor in 
this x-frame. Suppose we carry out a spatial coordinate transformation by 
relabelling the clocks, leaving their readings unchanged; such a transformation 
will take the form 

x* = /*(x l , x 2 , x 3 ), x 4 = x 4 (61.1) 

The transformed metric tensor is g tj and we shall have 

Px* Px J cx^ 

= ( 61 - 2 ) 
ex ex ‘ cx a ' 

This equation shows that behaves as a covariant 3-vcclor with respect to 
spatial coordinate transformations and hence determines a special direction at 
every point of 3-space. This is contrary to our assumption of isotropy for galactic 
observers and we conclude that c/, 4 = 0 throughout space-time. Thus, 

d-s 2 = s^dx’dx'’ + </ 44 (dx 4 ) 2 (61.3) 

Now consider the events of a coordinate clock indicating the times x 4 , 
x 4 + dx 4 . Let dr be the proper time interval between these events. Since the 
spatial coordinates of the clock never change, the metric (61.3) shows that 

— c 2 di 2 = y 44 (dx 4 ) 2 (61.4) 

This equation determines the relationship between the cosmical time x 4 and the 
standard time r shown on an atomic clock moving with the galactic observer. But 
this relationship must be independent of the galactic observer, since all are 
equivalent, and it follows that # 44 can only depend on x 4 . We can accordingly 
transform from x 4 to a new cosmical standard time i by a transformation 

c( = j V( -&U 4 )dx 4 (61.5) 

so that the metric (61.3) reduces to 

ds 2 = g tfi dx“dx 0 -c 2 dt 2 (61.6) 

Taking a section r = constant of space-time at a particular cosmical time, an 
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jf 3 with metric 


d.s 2 = (y ; , ^ dx : 'd.x■ ,, (61.7) 

is obtained. This is our model for the cosmos at this cosmical instant. By the 
cosmological principle, any galactic observer will find this Jt } to be homogeneous 
and isotropic. Choosing himself as pole, he will therefore be able to detine 
coordinates {a. 0, cj>) for which the metric (6 1 .7) takes the form (60. 1 3). Using this 
frame and the new eosmical time i, the metric (61.6) finally assumes the 
Robertson-Walker form 


d.s 


2 _ 


dcr 


, — =■ + a 2 (d(I 2 + sin 2 Od0 2 ) 
ktr 


■ c 2 dr 2 


(61.8) 


S is a constant for the cosmos at any given time t, but will in general vary with t; it 
will be referred to as the cosmic scale factor ; only when k = 1 can cosmic space be 
pictured as a hypersphere of radius S in <? 4 . 

It remains to check that the frame of reference is co-moving, as assumed at the 
outset. The galaxies will be falling freely in the gravitational field associated with 
the metric (61.8) and their world-lines must therefore be geodesics. Since we are 
supposing the spatial coordinates of a galaxy remain constant, x* = constant 
along a galactic world-line. Substituting in the geodesic equations (43.5), the 
condition they are satisfied is found to be HU = 0. For the metric (61.8), this 
condition reduces to the requirement r# 44 /rx* = 0, which is clearly true. 


62. Hubble's constant and the deceleration parameter 

The behaviour of the cosmic model derived from the Robertson-Walker metric 
will be determined when the value of k and the dependence of the cosmic scale 
factor S on the cosmical time t are known. From the physical data available today 
(1981), neither of these pieces of information can be derived with any degree of 
accuracy. It seems likely (see section 65) that k = + 1 and that the universe is 
closed, i.e. of finite volume. In regard to S(t), the value of its first derivative is 
known roughly, but even the sign of its second derivative is in doubt, although the 
general consensus of opinion is that it is negative, i.e. the cosmic expansion is 
slowing down. Instead of quoting values of these two derivatives, it is more 
convenient to work with the parameters 

H = $/S (62.1) 

q=-SS’$ 2 (62.2) 

H is called Hubble’s constant and has reciprocal time dimension; q is called the 
deceleration parameter and is dimensionless. In the present epoch, the value of 
l/H is often quoted to be about 1.8 x 10'° years, whereas the value of q is thought 
by some cosmologists to be about unity, although others would not exclude 
negative values. 
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At a fixed cosmical time r, the Robertson-Walker metric requires that ordinary 
space should have the metric (60. 1 3). Any galaxy can be thought of as being placed 
at the pole a = 0 and then the radial (or proper) distance of any other galaxy 
(a, 6, 4>) is given by 

,6131 

where a. = sin ‘ 1 a if k = 1, a. = a if k = 0, and a = sinh 1 a if k = — 1. Thus, the 
rate of recession of this galaxy from the galaxy at the origin is given by 

d = ct$ = Hd (62.4) 

This is Hubble's law that, at a given cosmical time, the rate of recession of any 
galaxy is proportional to its distance. H has the same value for all galactic 
observers at the time r, but will, in general, itself vary with t. Clearly, this law 
cannot be verified directly, since neither the distance d nor its rate of change are 
directly observable; in the next two sections, we shall derive an alternative 
relationship between associated quantities which can be measured. 


63. Red shift of galaxies 

Suppose that a galaxy G (<r , , 6 , , 0 , ) is being observed through a telescope f rom 
the pole O. Successive crests of a light wave emitted by Gat times t l( t, + dr, are 
received at O at times t 0 , f 0 + df 0 respectively. The world-line of each crest is a 
radial null geodesic along which 0 and 0 remain constant (the reader should check 
that the equations of a null geodesic can be satisfied with 9 and <t> constant). The 
Robertson-Walker metric shows that along such a world-line. 


da 

- ka 2 ) 



(63.1) 


Integration of this equation for the motion of each crest yields the equations 
f” 1 da f'°dr f'° + d, <'df 

Jo J (1 —ka 2 ) ~ C S ~J Ii+d „ S (63 ' ) 


7(1 —ka 2 ) 

Since dt 0 , dr, will be small, the last equation implies that 

dr 0 _ dr, 

S(to) S(r,) 


(63.3) 


dr , is the period of the emitted wave as measured by a standard clock at G, and dr 0 
is the period of the received wave as measured by a similar clock at O. Since 
wavelength is proportional to period, if 2 0 , A, are the wavelengths of the received 
and emitted light respectively. 




So/S, 


where S„ = S(r 0 ) and S, = S(r,). 


(63.4) 
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Thus, if the received light is redder than the emitted light, A 0 = /, + A/., , where 
A/.! is positive. The red-shift factor z is defined by the equation 


z 


A/-i 

'i 


/ -o 

-'■t 



(63.5) 


Clearly z is positive if S 0 > S,, i.e. the universe is expanding. 

If the observed galaxy is not too distant, t 0 - t , will be relatively small and we 
can expand S)^) in a Taylor expansion thus: 

S(C) = 5 (to) (to — t])$(to) + i(to — 1 1 ) 2 S (to) + . • • 

= S(to){l — W 0 (t 0 — r i ) it/o^o(to — t,) 2 +. . .} (63.6) 

where H 0 , q 0 are the values of Hubble’s constant and the deceleration parameter 
at the instant t 0 of observation. Substituting from the last equation in (63.5), we 
derive the result 


z — H 0 [t 0 — ti) + (i<)o+ l)Wo(to — t t ) 2 + . . . (63.7) 

By observing z for a number of galaxies and calculating (t 0 — t,) for each, this 
expansion provides a means of estimating the values of H and q at the present 
epoch. The calculation of (t 0 -f t ) is considered in the next section. 

64. Lnminosity distance 

If all galaxies possessed the same intrinsic luminosity, i.e. emitted light energy at 
the same rate, and if this luminosity were independent of the time, the observed or 
apparent luminosities of galaxies would depend upon their distances according to 
a calculable formula and an observation of the apparent luminosity would then 
provide us with a measure of the distance. Although there is considerable 
variation in the strengths of the galaxies as light sources, by confining 
observations to galaxies of a particular type and stage of evolution, this variation 
can be reduced and thus estimates of their distances can be obtained. Since 
galaxies tend to occur in clusters, once the distance of one member of a cluster has 
been found, the intrinsic luminosity of the other members can be determined, thus 
providing further useful information in regard to the probable intrinsic 
brightness of galaxies of other types; this information can then be utilized as a 
basis for later distance determinations. 

Suppose we take the pole O of coordinates (ct. 6 , 0 ) at some distant galaxy G 
whose luminosity is to be observed and let our point of observation A have 
coordinates (<x, jtt, 0). Photons emitted by G will have null geodesics as world- 
lines, along which 8 and 0 will be constant. Consider a photon which travels along 
the ray 9 = ^tr, 0 = £, where e is very small. This photon will ultimately arrive at 
the point (<r, ^7t, e) of closest approach to A when its distance from A will be given 
by equation (60.13) to be 


ds = S 0 cr£ 


(64.1) 
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where S 0 is calculated at the time of arrival t 0 of the photon in the vicinity of A. It 
now follows that, if the telescope at A has aperture of radius a , the photon will be 
collected by the telescope provided e < a/S 0 a. Thus, the telescope will collect all 
photons which left G along paths enclosed within a right circular cone of semi- 
vertical angle a/S 0 a. This cone embraces a solid angle 


2k (1 — cose) = ftc 2 = . 


Sr, o' 


S O ° 2 ' 


(64.2) 


where a = na 2 is the telescope’s aperture; it follows that the proportion of 
photons leaving G which are collected at A is ot/[4nSlo 2 ). 

According to equation (63.4), if v>, is the frequency of a photon when it leaves G 
at time f ,, its frequency v 0 on arrival at A at time t 0 is given by r 0 = S, V[/S 0 . But 
the energy of a photon is related to its frequency by the formula E = hv, where h is 
Planck’s constant. Thus, the energy of the photon is also reduced by a factor 
SJS 0 . Further, equation (63.3) indicates that the photons emitted from G over a 
time interval df i arrive at A over the longer time interval dt 0 = S 0 dr, /S,. Hence, 
the rate of reception of light energy is additionally reduced by a factor SJS 0 . If, 
therefore, L is the intrinsic luminosity of G, i.e. the total rate at which it emits 
radiation, the rate at which light energy is collected by the telescope at A is given 

by 

a S, S[ LaS 2 ^ , , 

tx iSp*s 0 *s,, = wx 11 


The apparent luminosity l of a celestial object is defined to be the rate at which 
light energy from the object flows across unit area normal to the line of sight at the 
point of observation. The last result shows that for G, 

/= LS 2 l /(4no 2 S* 0 ) (64.4) 


If space were Euclidean and G were stationary at a distance d from A, we should 


have 


/ = L/(4nd 2 ) 


(64.5) 


Substituting the actual value of / from (64.4) and solving for d, we find 


d = oSl/Si 


(64.6) 


This result is termed the luminosity distance of G. 

Returning to equation (64.4), we shall obtain an expansion for / in powers of 
(t 0 — (,). Equation (63.2) shows that 


do p" df 

o >/(! ~^ 2 ) J ,, S' 


(64.7) 


Assuming a is relatively small (i.e. G is not too distant), the integrand of the left- 
hand member of this equation can be expanded in powers of a to give 


i; 




V (1 -ko 2 ) 


o + -ko 3 

D 


(64.8) 



(64.9) 


Also, equation (63.6) leads to the result 


1 

S(i) 


1 


{ 1 + W 0 (t 0 —/) + .. . 


} 


Hence, the right-hand member of equation (64.7) can be expanded in the form 
c 77 = ■=- { ('o — *i) + 1 W 0 (t 0 — t, ) 2 4- . . . } (64.10) 

i, * 

Equating the expansions (64.8) and (64. 10), we see that, to the first order of small 
quantities, a = c(t 0 -t,)/S 0 . It follows that, to the second order in (r 0 — 1 1 ), 


° = -^-{(io-h) + iH 0 (t 0 -t l ) 2 + ...) (64.11) 

Substitution from equations (63.6) and (64.1 1) in equation (64.4) now yields the 
expansion 

, - wi' 0 -,,>i 1| - 3H ’ |, »- , ' l+ - I |64,2 » 

Finally, equation (63.7) shows that to the first order (f 0 - 1,) = z/H 0 . It follows 
from the same equation, therefore, that 

Ho('o-'i) = 2 ~ (i<?o + !)z 2 + . . . (64.13) 

Substituting for (r 0 — t,) from this equation into equation (64.12) accordingly 
leads to the expansion 

I LI 2 

1 = 7 — ri{ 1+ (‘?o- 1 )z+ • ■ ■} (64.14) 

4;rc z 

The importance of this last equation is that it relates two observable quantities / 
and z. By fitting it as closely as possible to the available data, estimates of H 0 and 
q 0 have been obtained. A more precise relationship follows from dynamical 
considerations (see Exercises 7, No. 5). 


65. Cosmic dynamics 

To gain further information in regard to the probable values of the functions H (t) 
and q(t) at cosmical times in the remote past and future, it is necessary to 
determine the equations of motion of the cosmos. These will be provided by 
Einstein’s equations of gravitation. It is therefore necessary first to calculate the 
energy- momentum tensor for the galactic gas. 

The 4-velocity of a particle whose world-line in the x-frame has equations 
x 1 = x‘(t) (t = s/ic being proper time measured by a standard clock moving with 
the particle) has been defined (section 50) to be the contra variant vector V given 

by 
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If the frame is inertial and Minkowski coordinates are used, this definition is in 
agreement with that adopted in the special theory (section 15). 

Now consider a perfect fluid. At any point in the fluid, we can construct a freely 
falling frame which is locally inertial and in which Minkowski coordinates can be 
defined. The special theory is valid in any such frame and the fluid’s proper 
density of proper mass p 00 and pressure p can be defined as 4-invariants (section 
22). The energy-momentum tensor in the frame then follows from equation 
(22.21). In the general x-frame, this tensor must therefore be given by the equation 

T ij =( f i 00 + P/c 2 )V‘V i + P9 iJ (65.2) 

for this is a tensor equation and reduces to the valid equation (22.21) in the freely 
falling Minkowski frame. 

Along a world-line of a particle of the galactic gas, the coordinates (cr, 6 , <p) 
remain constant and hence, from the metric equation (61.8), we deduce that t = t. 
Thus, the 4-velocity of this particle has components ( V') = (0,0,0, 1) in the 
Robertson- Walker frame. The non-zero contra variant components of the metric 
tensor are 

g" = (1 -ka 2 )/S 2 , g 22 = l/S 2 o 2 , g 33 = cosec 2 0/S 2 <r 2 , </ 44 = -1/c 2 

(65.3) 

We can now calculate the non-zero components of the energy-momentum tensor 
for the galactic gas from equation (65.2); they are 


T" = pi, 1 -ko 2 )/S 2 
T 22 - p/S 2 cr 2 
T 33 = p cosec J 0/S 2 <r 2 
r 44 =p 


(65.4) 


where we have deleted the subscripts in p 00 for convenience. The covariant 
components now follow immediately, viz. 


T u = pS 2 /(l — ka 2 ) 
T 22 =pS 2 o 2 
r 33 = pS 2 a 2 sin 2 0 
r 44 = /tc 4 


(65.5) 


The non-zero components of the Ricci tensor for the Robertson-Walker 
metric may be verified to be 

R u = -P/( 1 -kc 2 ), R 22 = - Pa 2 , R 33 = — Pa 2 sin 2 0, R 44 = 3 S/S 

(65.6) 

where P = 2k + (SS + 2S 2 )/c 2 (65.7) 

dots denoting differentiations with respect to t. 
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The reader may also verify that the curvature scalar is given by 

R = R\ = - 6 (SS + S 2 + kc 2 )/c 2 S 2 (65.8) 

We can now construct Einstein’s equations (47.16) (covariant form); only two 
distinct equations emerge, viz. 

2SS + S 2 +kc 2 -c 2 \S 2 = -Kc 2 pS 2 (65.9) 

3(S 2 +kc 2 )-c 2 \S 2 = xc*pS 2 (65.10) 

Together with an equation of state p = p{p), these equations are sufficient to 
determine the unknown functions S, p and p. 

Before attempting to integrate these equations, certain important conclusions 
can be reached by a direct study. 

Equation (65.10) can be written in the form 

3c 2 

~~2 k — kc*p — (3/f 2 — c 2 A) (65.11) 

from which it follows that k = + 1 if 

p > p c = (3H 2 -c 2 A)/kc 4 (65.12) 

and k = - 1 if the reverse inequality is true. p c is therefore a critical density (the 
closure density) for the galactic gas, determining whether the universe is of finite 
or infinite volume. Assuming A = 0 and taking H = 1.8 x 10~ 18 (at the present 
epoch), kc 4 = 8trG = 1.67 x 10~ 9 (all SI units), we calculate that 

p c = 6 x 10~ 27 kgm -3 (65.13) 

If the matter in the galaxies were spread uniformly over the whole of space at the 
present epoch, it is estimated that its density would be p = 3 x 10 _28 kgm' 3 . 
Thus p < p c and this datum suggests that the universe is open. However, the 
presence of a very tenuous intergalactic dust or gas, far below the limit of possible 
observation, could easily reverse this conclusion. We accordingly seek further 
information from equation (65.9). 

This equation can be written 

c 2 

- 1 k = (2q-\)H 2 +c 2 \-Kc 2 p (65.14) 

Again taking A = 0 and assuming p to be negligible, we conclude that k = + 1 if 
q > i- Unfortunately, the red shift-luminosity relationship, from which the value 
of q is derivable (see section 64), has not been established with sufficient certainty 
to decide for or against this inequality. However, the data available tends to 
support it. 
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66. Model universes of Einstein and de Sitter 

The first solution of the dynamical equations (65.9) and (65.10) was suggested by 
Einstein himself. His proposal was made some years before Hubble published his 
observations relating to the recession of the galaxies, and the possibility that the 
universe was in a state of expansion was not considered by astronomers at that 
time. Einstein’s universe was therefore static and the equations were satisfied by 
taking 5 to be constant. Pressure and density had then to satisfy the equations 


* 

T3 

II 

> 

1 

^1*- 

(66.1) 

2 3fc A 

KC P = ^5 - A 

(66.2) 


Clearly, if A = 0, since p and p cannot be negative, the only possible solution is 
for p , p and k all to be zero, i.e. an empty, infinite, Euclidean cosmos. It was in 
order to be able to reject this solution that Einstein introduced the cosmical 
constant term into his equation of gravitation. 

Adding equations (66.1) and (66.2), we find 

= x(p + c 2 p) (66.3) 

proving that k must be positive, i.e. k = + 1 and the universe is closed. Equations 
(66.1) and (66.2) now show that A must satisfy the inequalities 

A ~ (66.4) 

S is the radius of the universe, which is certainly very large, indicating that A will 
be quite inappreciable except for phenomena on the cosmic scale. Following upon 
Hubble’s discovery, Einstein abandoned his model and with it, the cosmical 
constant. Since then, A has been put to zero in most cosmological investigations 
and we shall follow this practice in the remaining sections of this chapter (except 
in certain exercises at the end). 

The necessity for including the cosmical constant term if the universe is to be 
static, follows from very elementary considerations. Without it, an initially static 
cosmos will collapse under the gravitational attractions of its constituent galaxies. 
If A is positive, the term implies the existence of a counterbalancing long-range 
repulsive force between the galaxies, which increases with their distance of 
separation (Exercises 6, No. 40). However, although these two opposing 
gravitational forces can result in equilibrium, putting p = 0 to match the present 
state of the cosmos and hence A = I/S 2 , it may be proved that the equilibrium is 
unstable (Exercises 7, No. 8). 

Following upon Einstein’s proposed model, de Sitter derived another which is 
of some historical interest. In this context, his model is most conveniently 
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constructed by adopting as our basic assumption that Hubble's constant does not 
change with time, i.e. 


S/S — H = constant 

(66.5) 

Integrating, we get 


S = A exp Hi 

(66.6) 

where A is the value of S at some arbitrarily chosen origin of 
Substituting in equations (65.9) and (65.10), we find that 

cosmical time. 

3 H 2 k 

xp= A- 2 jexp(-2Ht) 

(66.7) 

2 3H 2 3k 

kc n= -Ah y + - , exp ( - 2 Hi) 

c A 

(66.8) 

As f — > x , 


Kp -> A - 3 H 2 /c 2 , kc 2 p -* — A + 3 H 2 /c 2 

(66.9) 

Neither of these limits can be negative, so we must require that 


A = 3 H 2 /c 2 

(66.10) 

Thus, equations (66.7) and (66.8) lead to the result 


3 p + c 2 p = 0 

(66.11) 

from which we conclude that 


p = 0, p = 0 

(66.12) 


and, therefore, that k = 0. 

Thus, de Sitter’s universe is Euclidean, but being empty is only of academic 
interest. Although the model appears to be in a state of expansion, since no matter 
is present, this phenomenon has no physical basis; indeed, it is possible to derive a 
static metric for the model by carrying out certain tranformations on a and t (see 
Exercises 7, No. 3). If the galaxies are represented by freely falling particles having 
negligible mass and constant spatial coordinates, an expanding infinite cosmos is 
obtained; however, each galactic observer is limited to a finite universe, since he 
cannot penetrate beyond a certain distance called his event horizon (see Exercises 
7, No. 4). 


67. Friedmann universes 

These models are constructed from equations (65.9) and (65.10) by putting p = 0, 
i.e. by treating the galactic gas as an incoherent dust cloud. We shall also put A to 
zero. 
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We first note that equations (65.11) and (65.14) reduce to the forms 


kcV= 3H 2 + lkc 2 /S 2 

(67.1) 

(2q -\)H 2 = kc 2 JS 2 

It follows that 

(67.2) 

kc x h = 6qH 2 

and hence that q can never be negative. 

Equation (65.9) gives 

(67.3) 

2 SS + S 2 + kc 2 = 0 

Writing this equation in the form 

(67.4) 

-^-(SS 2 )= -kc 2 S 
at 

we can integrate and obtain 

(67.5) 

SS 2 = c 2 {D — kS) 

D being constant. 

Substitution in equation (65.10) now yields 

(67.6) 

tiS 3 = 3 D/kc 2 = constant 

(67.7) 


showing that the constant D must be positive. This is the equation of conservation 
of mass (see Exercises 7, No. 2). In the case of a finite cosmos, the total volume is 
known to be 2n 2 S 3 and the total proper mass is accordingly 2n 2 S 3 fx, which is 
conserved. This is to be expected, since there is no interaction between the 
particles of the dust cloud and their proper masses therefore never change. 

We shall next integrate equation (67.6), with k taking its three possible values, 
separately. 

(i) If k = 1, the equation is 

SS 2 = c 2 (D — S) (67.8) 

Changing the dependent variable from S to u by the transformation 

S = $D(l -cosu) (67.9) 

we have S = j Du sinu and equation (67.8) leads to the equation 

jD(l — cosu)u = c (67.10) 

This integrates immediately to the form 

ct = \ D (u — sinu) (67.11) 

choosing t to be zero when u = 0 (i.e. the origin of cosmical time is taken to be the 
instant when S = 0). Equations (67.9) and (67.1 1) determine Sasa function of f; 
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since these equations are the well-known parametric equations for a cycloid, the 
plot of S against ct must take this form (see Fig. 9). 

Our conclusion is that the finite universe will expand from a singularity at i = 0 
to a maximum radius D when u ~ ttandr = JiD/2c,and will then contract back to 
the singular state at u = 2n, r = nD/c. The actual behaviour in the vicinity of the 
singularity is, of course, not predicted by our analysis, since p has been neglected 
and the contribution of the radiation ignored. 

(ii) If k = 0, equation (67.6) gives 

SS 2 =c 2 D (67.12) 

The variables separate and integration yields 

S 3 = 9c 2 Dt 2 /4 (67.13) 

again taking t = 0 at S = 0. This universe expands from a singularity and 
continues to do so indefinitely (Fig. 9). Space is Euclidean and the cosmos is open. 

(iii) If k = — 1, equation (67.6) takes the form 

SS 2 = c 2 (D + S) (67.14) 

As in the case k = 1 , we change the variable to u, this time by the transformation 

S = jDlcoshu - I ) (67.15) 

and integrate to find that 

cl = jD(sinhu — u) (67.16) 

taking u and / to be zero at S = 0. As in the previous case, the initial expansion is 
never reversed and the universe is open (Fig. 9). 



192 


It remains to calculate two constants of integration, viz. D and the value f 0 of f 
in the present epoch, in terms of H 0 and q 0 . 

Putting present values into equation (67.2), if k ^ 0 we get 


S 2 = 


kc‘ 


° aq 0 -\)H 2 0 

Equation (67.6) can be written in the form 

H 2 = c 2 (D — kS)/S 3 

Thus. 

D = ~S i 0 H 2 +kS 0 


(67.17) 


(67.18) 

(67.19) 


If * 0, equation (67.17) now shows that 


D = j~q 0 (2<lo -l) ' 3 2 . 9o > h k = 1 
”0 

= TT 4o<* — 2^o) “ 3 2 .0 <q 0 <isk = - 1. 
"o 


(67.20) 


If k = 0, equation (67.2) requires that q = j and equation (67.19) gives 

D = SqHo/c 2 (67.21) 


S 0 , H 0 are independent parameters for this model. However, S 0 is superfluous and 
may be eliminated by transformation of the coordinate a to a' = S 0 o; S 0 then 
disappears from the metric. Equivalently, we may take S 0 = 1 and write equation 
(67.13) in the final form 

4S 2 = 9Hlt 2 (67.22) 


The present age t 0 of the universe is now calculable for the three types of model, 
(i) If k = 1, q 0 > j, equation (67.9) shows that 

cos u 0 = 1 — 2 S 0 /D = l (67.23) 

9o 

having used equations (67.17) and (67.20). Equation (67. 1 1 ) now yields the result 


<o = 7T'[9 o(2</o - I) 3 2 cos " 1 (^o 1 — 1 ) — (2^o — 1 ) ‘ J (67.24) 
"o 

the inverse cosine being taken in the first or second quadrant. If values q (l = I. 
1/H 0 = 1.8 x 10‘° years are substituted, we find ( 0 = I0 10 years very nearly, i.e. 
ten billion years. 

(ii) If k = — I, 0 < q 0 < 7, a similar calculation leads to the result 

; o = 77— T ( 1 -2tj 0 r 1 -r/ 0 U — 2t/ 0 ) _ 3 2 cosh “ 1 (<7o ‘ -•)] (67.25) 

“0 
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Suppose we calculate q 0 from equation (67.3), taking /i = 3x 10“ 28 kgm~ 1 (i.e. 
assuming all the matter has been attracted into the galaxies) and H 0 = 1.8 
x 10' 18 s“‘.Then(j 0 = 0.025 and the formula (67.24) gives t 0 = 1.6 x 10'° years 
(16 billion years). 

(iii) Finally, if k = 0, q = putting S = 1 into equation (67.22) gives 

t 0 = 2/{3H 0 ) (67.26) 

With 1/7/ 0 = 1.8 x 10 10 years, this makes the present age of the cosmos to be 
12 billion years. 

For the early stage of cosmic expansion, the model’s failure to include the 
contribution of the radiation renders it invalid. However, since this stage is 
expected to be short (about 10 6 years), the corrections which need to be made to 
the values of t 0 calculated above to allow for this failure are negligible. 

The stage of radiation dominance is studied in the next section. 

68. Radiation model 

During the early stages of cosmic expansion, it is believed that the dominating 
factor was the electromagnetic radiation. In this section, we shall study a 
simplified cosmological model, containing radiation alone, which will provide a 
first approximation to this early state of the universe. 

We shall assume that the radiation is isotropic for each galactic observer and 
has energy density U (the same for all observers at a cosmical time t). Such an 
observer is in a state of free fall and can be equipped with rectangular axes and 
associated standard clocks, which behave locally like an inertial frame. 
Minkowski coordinates y' can be defined in this frame and, with respect to these, 
the observer’s 4-velocity is V = (0, it) (section 15). 

In this y-frame, the results of the special theory are applicable and, in particular, 
equation (29.5) defines the energy-momentum tensor for the radiation. We 
shall assume that the time variations of the field components £ a , //„ of this 
radiation are quite random and that there is no correlation between these 
components. Thus, if ot =/=/?, 

m(£ 1 £^) = 0, m(H.H^) = 0 (68.1) 

where m(Q ) denotes the mean value. Also, since the radiation is isotropic, m(E* ) 
and m{H \ ) will be independent of a. Hence, taking mean values in equation 
(29.16), we find 

e 0 m(El) + n 0 m(Hl) = ^U (68.2) 

U being the mean energy density of the radiation. 

We can now calculate the mean values of the components of S, ; . If a =/=/?, it 
follows from equations (29.14) and (68.1) that m(S lff ) = 0. Also, since the 
radiation is in a steady state, the rate of energy flow in any direction is zero and. 
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hence, the mean values of the components of the Poynting vector all vanish; i.e. 
m(S j4 ) = 0. It is clear, therefore, that only the means of the diagonal components 
S, ,, S 2 2 , S 3J , S 44 are non-zero and. for these, equations (29.13| and (29.16) give 

m(S,, ) = m(S 22 ) = m(S,,) = }C 0 m(£ 2 ) + ^ 0 m ( j L' (68.3) 

m(S 44 ) = —U (68.4) 

A tensor equation for S‘ J . valid in any frame, can now be written down, viz. 

S ' 1 = 4 -. V'V<+ 1 g‘>V (68.5) 

3 c 3' 


where V' is the 4-velocity of the local galactic observer. U is uniquely defined and 
is thus a 4-invariant. Hence, this equation certainly defines a contravariant tensor. 
The equation is easily verified to give the mean components just calculated in 
the y-frame. It is therefore valid in all frames. If the equation is compared with 
equation (65.2) for a perfect fluid, it will be seen that the radiation behaves like a 
perfect fluid of density V/c 2 and pressure (7/3. 

In the Robertson Walker frame, ( V') = (0, 0, 0, 1). Hence, the non-zero 
components of the energy-momentum tensor in this frame are: 


S“ = 1/(1 -ka 2 )/3S 2 
S 22 = U/(3S 2 ff 2 ) 

S 33 = U cosec 2 0/(S 2 a 2 ) 

S 44 = Vic 2 


( 68 . 6 ) 


Using the components of the Ricci tensor already calculated in section 65, the 
Einstein equations for the model can now be calculated. They prove to be 

2SS + S 2 + kc 2 -c 2 AS 2 = - \kc 2 US 2 (68.7) 

3(S 2 +kc 2 )—c 2 AS 2 = kc 2 US 2 (68.8) 


Since S will be comparatively small during this phase, even if A is non-zero, the 
terms containing this constant will be negligible. We accordingly put A = 0. Also, 
if fc j= 0, the term kc 2 will be negligible by comparison with the terms 2SS and S 2 . 
For. during the matter-dominated phase, equation (67.6) shows that 



and fee 2 is small by comparison with S 2 provided D/S > 1. But 


(68.9) 


D _ D S 0 _ 2 q 0 S 0 

S“So S'"|2 90 -1|"S 


( 68 . 10 ) 


having used equations (67.17) and (67.20). At the beginning of this phase, S 0 /S is 
large and it follows that D/S is large. Thus, kc 2 is negligible by comparison with 
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S 2 and, since S takes even larger values during the earlier radiation dominated 
phase, we shall neglect kc 2 during this phase. We have proved, therefore, that in 
this early stage of the cosmic expansion, the behaviour of the cosmos is 
independent of the values of A and k. 

Eliminating U between equations (68.7) and (68.8), it is now found that 

SS + S 2 = - d -(SS) = 0 (68.11) 

dr 

Two integrations now yield the results 

S = A/S, S 2 = 2At (68.12) 

where A is constant. Substituting in equation (68.8), we find 

kc 2 U — 3S 2 /S 2 = 3/(4r 2 ) (68.13) 

If we assume that during this phase the radiation is in thermal equilibrium with 
the matter present, then it will possess a black-body spectrum and the 
temperature T will be given in terms of the energy density U by Stefan's law, viz. 

U=aT 4 (68.14) 

where a = 7.5 x 10 16 J m 3 K 4 is the Stefan- Boltzmann constant. Thus, 
equation (68.13) leads to the following formula for T: 

( V 2 V 4 

T= - r 1 2 = 1.52r - 1 2 x 10 10 K (68.15) 

\32nGa ) 

One second after the inception of the expansion, this formula indicates a 
temperature of 1.52 x 10 10 K for the cosmos. 


69. Particle and event horizons 


Equation (64.7) determines the coordinate a of an object which is observed in our 
telescopes at the present epoch r 0 , by light which was emitted by the object at time 
1 1 . Since r, cannot be less than the time i = 0 at which the cosmic expansion 
commenced (accepting the big-bang hypothesis), the most distant object which 
we can observe today has coordinate a given by 


r = P“ 

Jo 7(1 -ko 2 ) C Jo 

ich an 

/„ = S 0 j” 


df 

~S 


The proper distance of such an object is therefore 

da 


7d -ko 2 ) 


i 

a 

J o 


c I dl 

CSn I ~ 


(69.1) 


(69.2) 


d H is said to be the proper distance of the panicle horizon. 

In the case of the Friedmann model with k = + 1, equations (67.9) and (67.1 1) 
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give 


and, thus, 


d( jD(l -cos u)du 

c —■ - = — , = die 

S iD(l -cosu) 


(69.3) 


die — So u o — Tp(2flo — 1) 
H u 

by equations (67.17) and (67.23). 

The reader is left to obtain the results 




(69.4) 


2c 

Ho' 


k = 0 


(69.5) 


- 1 2 cos [j 1 ( 


d« - — (1 - 2qo ) 
Ho 

if k = - 1 


<?o 


1 . 


(69.6) 


for the other values of k, in a similar manner. 

In particular, if k = + 1, q 0 = 1, l/H 0 = 1.8 x 10 10 years, we calculate that 
d H = 2.8 x 10 10 light years. 

Equation (64.7) can be utilized in an alternative manner. If an event occurs at 
the point with coordinate a at the present epoch r 0 , we shall observe it at time t, 
provided 


’ d cr [*'•• dt 

o >/(l-*<r 2 )“ C J l4 S 


(69.7) 


In the case of the Friedmann model with k = + 1, we must have t, < nD/c, since 
the cosmos collapses to a point at lime nD/cJork = Oor - l,i , can be arbitrarily 
large. Thus, the proper distance in the present epoch of the most distant event we 
can ever hope to see is given by 


dt — <S 0 



(69.8) 


where t mal = nD/c if k = +1, and t m>1 = x if k = 0 or — 1. If d, = x, then all 
events happening in the present epoch will ultimately be observed in our 
telescopes, d t is termed the proper distance of the event horizon. 

In the case k = + 1, we calculate that 


dt = S 0 (u ma> -«u) = S 0 (27T - u 0 ) 

= ^-(2flo-l)"' - 2 cos-* f — — 1 ) (69.9) 

Ho \q o ) 

provided the inverse cosine is taken in the range (zr, 2zr). With q 0 = 1,1 /H 0 = 1.8 
x 10 10 years, it will be found that d, : = 8.4 x 10 10 light years. Light from events 
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occurring at a greater proper distance wiil not have reached us before the cosmos 
collapses to a singularity. 

The cases A: = 0 and — 1 both give an infinite value of d t and all events 
happening today will, in these models, ultimately be observable on the earth. 


Exercises 7 

1. Transforming to a new radial coordinate r by the equation a = r/( 1 -+• 3 Acr 2 ), 
obtain the Robertson-Walker metric in the form 

ds 2 = (- — 7 — 7 ^ {dr 2 + r 2 (dd 2 + sin 2 8d<j> 2 )} -c 2 dt 2 
\1 + j fcr / 

2. From the conservation equation 7 ' 4 ‘ , = 0, obtain the equation 

d , 3 , . 

— (pS i ) + - i S 2 Sp = 0 

df c 

Deduce that, if p = 0, then p oc 1/S 3 . 

3. Show that the de Sitter metric 

d.s 2 = /( 2 exp(2W7’)(dff 2 + a 2 d8 2 + a 2 sin 2 9d<p 2 ) - c 2 di 2 
can be transformed to the static form 
dr 2 

ds 2 = - — + r 2 (df ? 2 + sin 2 Odtp 2 ) - c ^ 2 | 1 -H 2 r 2 c 2 )dT 2 

1 


by the transformation equations 


a = 


r exp( — HT) 
A J(l~H T r ~ 


2 >c 2 )' 


" t +Th ' ogl 


' rK I 


4. A photon is emitted from the point (a, fl, 0) along the radius to the origin at 
time t in the de Sitter universe whose metric is given in the previous exercise. Show 
that the time taken to reach the origin is 


H 


log 


. ha 

1 aexp(tfr) 

( 


Hence show that if the proper distance of the point from the origin at the time r is 
greater than c H, the photon will never arrive at O. 

5. Show that, for all the Friedmann models with A = 0, p = 0. the luminosity 
distance d L of a galaxy whose red shift is r is given by 


d L = 77— 2 UoZ + (</o — 1) [ v (2</ 0 r + 1 ) — 1 }]. 
n 0 <?o 

If z is small, verify the expansion (64.14). 



198 


6. Show that if A is not assumed to vanish in the Friedmann model, then S{t) 
satisfies 

SS 2 = c 2 (D -kS + 3 AS 3 ) 

where D is a matter density parameter defined by the equation Kt 2 /iS 3 = 3D. 
Show that the special case k = 0, D = 0 yields the de Sitter universe. 

7. Sketch the graph of SS 2 against S for the Friedmann model with cosmical 

constant (previous exercise) and deduce that (i) S increases from zero to a 
maximum value and then decreases back to zero in the cases: (a) k = + 1, A 
< 4/(9 D 2 ), (b) k = 0, A < 0, (c) k = — 1, A < 0; (ii) S increases steadily from 
zero and tends to infinity in the cases: (a) k = + 1, 

A > 4/(9D 2 ), (b) k = 0, A ^ 0, (c) k = — 1, A ^ 0. In case (i) (a), if A is positive, 
show that there is also a solution in which S decreases to a non-zero minimum and 
thereafter steadily increases towards infinity. 

8. lf/c= + 1 and A = 4/(9D 2 ), show that the radius S of the Friedmann model 
can first increase from zero to a value 1/,/A, when the cosmos attains the static 
Einstein state (section 66) and then, if slightly disturbed from this state, S may 
either decrease back to zero or continue to increase indefinitely. (This shows that 
the static Einstein universe is unstable.) 

9. A cosmos containing radiation, but no matter, is governed by the equations 
(68.7) and (68.8). Show that 

S 2 S 2 = c 2 (D-kS 2 + \\S*) 

where D is an energy density parameter defined by the equation 3D = xUS*. 

10. Sketch the graph of S 2 S 2 against S for the universe described in the 
previous exercise and deduce that all the conclusions listed in exercise 7 are valid if 
4/(9 D 2 ) is replaced by 3/(4D). 

11. For the universe described in exercise 9, if k = 1,A = 3/(4D),andS = Oat; 
= 0, prove that at any later time i, 

S 2 = 2D{ 1 — exp( — cl/ J D)} 

If S = yJ(2D) at t = 0, prove that the universe is static but unstable. 
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